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SOME THEORETICAL RESULTS ON NEURAL 
SPIKE TRAIN PROBABILITY MODELS 1 

By Hock Peng Chan and Wei-Liem Loh 

National University of Singapore 

This article contains two main theoretical results on neural spike train models. 
The first assumes that the spike train is modeled as a counting or point process 
on the real line where the conditional intensity function is a product of a free 
firing rate function s, which depends only on the stimulus, and a recovery func- 
tion r, which depends only on the time since the last spike. If s and r belong 
to a g-smooth class of functions, it is proved that sieve maximum likelihood 
estimators for s and r achieve essentially the optimal convergence rate (except 
for a logarithmic factor) under L\ loss. 

The second part of this article considers template matching of multiple spike 
trains. P-values for the occurrences of a given template or pattern in a set of 
spike trains are computed using a general scoring system. By identifying the 
pattern with an experimental stimulus, multiple spike trains can be deciphered 
to provide useful information. 

1 Introduction 

In the field of neuroscience, it is generally acknowledged that neurons are the basic units of 
information processing in the brain. They do this by generating characteristic very short 
duration and highly peaked electric action potentials, or more simply spikes, in its body 
[see, for example, Dayan and Abbott (2001)]. These spikes can travel along nerve fibers that 
extend over relatively long distances to other cells. The temporal pattern of these spikes 
depends dynamically on the stimuli of the neuron or the biochemicals induced by the spikes 
of other neurons. The collection of such spikes generated by a neuron over a time period 
is called a spike train. In this way, information is transmitted via spike trains. Because 
the spikes are of very short duration and are highly peaked, point processes or counting 
processes are the most commonly used probability models for neural spike trains, with 
points on the time axis representing the temporal location of the spikes [see, for example, 
Brillinger (1992)]. 

Let N(t) denote the number of spikes on the interval [0, t). N(.) counts the number of 
spikes and hence is a counting process. Let w\ < W2 < ■ ■ ■ < itfzv(t) be all the spike times 
occurring in [0,i). We assume that the following limit exists: 

A(%i, • • • , w N{t) ) = lim -E[N(t + 5)- N(t)\wi, • • • , w N{t) ], 

A(.|.) is known as the conditional intensity of N(.). 

1 AMS 2000 subject classifications. Primary 62E20; secondary 62G20, 62M20. 

Key words and phrases. Boundary crossing probability, change of measure, conditional intensity, counting 
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In the neuroscience literature, a number of probability models for A(.|.) have been pro- 
posed. One of the simplest is when X(t\wi, ■ ■ ■ , w^m) depends only on t. This leads to an 
inhomogeneous Poisson process [see, for example, Ventura et al. (2002)]. It is well known 
that for a short period of time after a spike has been discharged, it is more difficult or even 
impossible for a neuron to fire another spike [see, for example, Dayan and Abbott (2001), 
page 4]. Such a time interval is called the refractory period. The main drawback with the 
inhomogeneous Poisson process model is that it does not incorporate the refractory period 
of the neuron. To account for this, a number of researchers [for example, Johnson and 
Swami (1983) and Kass and Ventura (2001)] have proposed modeling A(.|.) by 

X (t\wi, ■ ■ -,w N( t)) = f(t,t- w N( t)), 

where / is a nonnegative function. This model is Markovian in that it only depends on the 
present time t and the duration t — w N ^ since the last spike. 

A simpler alternative model for the conditional intensity of N(.) that has been proposed 
in the literature [see, for example, Johnson and Swami (1983), Miller (1985) and Berry and 
Meister (1998)] is 

m \ u\ ^ J s ^ if iV(t) = 0, 

(1) XlitlWU '"' Wm) = \s(t)r(t-w m ) XN{t)>l, 

where s,r are nonnegative functions, s and r are known as the free firing rate function 
and the recovery function respectively. This model has the added attractiveness of easy 
interpretability. 

This article consists of two parts. The first part considers sieve maximum likelihood 
estimation of s and r in (1) based on n independent realizations of N(t),t G [0, T), where 
< T < oo and N(.) is a counting process with conditional intensity Ai(.|.). Here we 
assume that the true free firing rate function s and recovery function r both lie in the class 
of g-smooth functions G>h,q where ®k,q is defined as in (3). Section 2 computes upper bounds 
on the metric entropy of Qn,q as well as other function spaces induced by &a,q- These results 
are needed in Section 3. 

Section 3 focuses on sieve maximum likelihood estimators s n and f n for s and r re- 
spectively. Assuming that there exists an absolute refractory period (that is, there exists 
a constant 6 > such that r(u) = 0, for all u G [0,0]), it is proved in Theorem 2 that for 
q > 1/2, 

fT 

E s ,r[ / \s n {t) - s(t)\dt] = 0{n- q l {2q+l ^ log 1 / 2 n), as n -> oo. 
Jo 

If, in addition, s(t) > for t G [0, T], then Theorem 2 shows that 

E s ,r[ / \r n {u) - r(u)\du] = 0(n- q ^ 2q+ ^ log 1 / 2 n), as n — ► oo, 
Jo 

where T* is an arbitrary but fixed constant satisfying < T* < T. 

In Section 4, corresponding lower bounds for the convergence rate are established. In 
particular under the assumptions of Section 3, Theorems 3 and 4 prove that it is not possible 
to achieve a faster convergence rate than n^ q ^ 2q+1 \ Thus we conclude that sieve maximum 
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likelihood estimators for s and r achieved essentially the optimal convergence rate (except 
for a logarithmic factor). 

The second part of the article deals with the detection of multiple spike train patterns. 
Let = {wi \ . . . jwjv^m} be the spike times of the ith neuron for 1 < % < d, and let 

w = (w^, . . . , w^). Loosely speaking, the pattern or template w is said to have occurred 
at time t in the spike trains y = (y^, . . . , y^) if for most y G yW n [t, t + T), 1 < i < d, 
there exists w £ such that y — w — t is close to 0. A more rigorous definition of a 
match, via a user-chosen score function, is given in Section 5. When the number of matches 
is significantly large, we can identify the onset of the patterns w in y with the stimulus 
provided to the subjects when w is recorded. For example, w can be the spike times of 
an assembly of neurons of a zebra finch when its own song is played while awake and y is 
the spike trains of the same assembly when it is sleeping. The replaying of these patterns 
during sleep has been hypothesized to play an important role in bird song learning [cf. Dave 
and Margoliasch (2000) and Mooney (2000)]. 

In Brown, Kass and Mitra (2004), it was stated that "research in statistics and signal 
processing on multivariate point process models has not been nearly as extensive as re- 
search on models of multivariate continuous-valued processes" in the section titled "future 
challenges for multiple spike train data analysis" . We develop in Sections 5 and 6 a theory 
for computing the distribution of scan statistics in multivariate point processes and apply 
it to obtain p-values for template matching. The accuracy of these computations is then 
verified independently via computer experiments. 

2 Metric entropy 

In this section, suppose N(.) is a counting process with conditional intensity Ai(.|.) as given 
by (1). We assume that a realization of TV is observed on the interval [0, T), < T < oo, 
and that the spike times are < w± < W2 < ■ ■ ■ < w N ^ < T. It is convenient to let 
{wi, ■ ■ ■ ,Wj\[(t)} denote the point process corresponding to N(t),t £ [0,T), and J\f be the 
set of all possible realizations of {wi, • • • , itf?v(T)}- It follows from Chapter 7 of Daley and 
Vere-Jones (2002) that the likelihood is the local Janossy density given by 

N(T) 

(2) PsA{^---,w N(T) }) = e-fo<t)r(t- u , Nit) )dt -Q s{wj)r{wj _ Wj _^ 

i=i 

and hence its log-likelihood function is 

nT N ( T ) 

l(s,r\{w!,- ■ ■ ,w N( T)}) = - s(t)r(t - w N u))dt + ^ log[s(wj)r(wj - Wj-i)] 

J ° 3=1 

w 1 AT t-T 

s(t)dt — j s(t)r(t — W]y(t))dt 

JwiAT 

N(T) 

+ Yl l °S[ a (. w j) r (. w 3 ~ w J~l)i 
3=1 
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where r(t - w ) = 1 for all t G [0, T). 

Next let q, qo, q\ be constants satisfying q > 0, q = qo + q±, qo a nonnegative integer and 
< gi < 1. Furthermore we write k = («o, ■ ■ ■ , ^ ?0 +i) to be a vector of strictly positive 
constants. In this section, we assume that the true free firing rate function s and the 
recovery function r lie in the (/-smooth function class ®k,q where 

r d? 
©R,5 = {/ = 9 2 ■ 9 e C«> [0, T),mm ^(t) > 0, max | ^(t)| < Kj ,j = 0, • • • , q , 

d qo d qo ^ 

(3) \—g(t 1 )-—g(t 2 )\< Kqo+1 \t 1 -t 2 r,Vt 1 ,t 2 e[0,T)y 

Let {0 < 5 n < 1 : n = 1, 2, ■ • •} be a sequence of constants (to be suitably chosen later and 
S n depends only on n) such that 5 n — > as n — > oo. We define a sieve for the parameter 
space of Q^q by 

&k, q ,n = \f = g 2 :geC qo [0,T), min > S n , max |— #(i)| < Kj , j = 0, ■ ■ ■ , q , 
I te[o,T) te[o,T) dP 

d qo d qo i 

Let @k,q and &k,q,n be endowed with the metrics pQ k and pe s , „ respectively where 

Pe- K , q {h,h) = sup |/J /2 (t)-/ 2 1/2 (t)|, V/i,/ 2 €e^, 
te[o,T) 

Pe s ,„„(/i./2) = sup |A 1/2 (0-/2 1/2 (*)l> V/l,/2€0^ )n . 
te[o,T) 

We observe that any / G 0«, ? can be approximated arbitrarily closely by (f 1 / 2 +6 n ) 2 G ©K, 9 , n 
by choosing n sufficiently large. Consequently a sieve for the parameter space of (s, r) can 
now be expressed as ©§ _ „ = 0« a n x 0^ n with metric p ft 2 where 

Pel qn ((fi,9i),{f2,g2)) = PQa^Mu h) +Pe*, q ,n (91,92), V(/i,5i),(/ 2 ,5 2 ) g ©l i?;n . 
Next let 

^■«, 9> n = {p«i,n is as in (2): (si,n) G ©| ?jn }, 
be endowed with the Hellinger metric p^- n where 

PFn, q ,n (Psi ,r\ , Ps 2 ,r 2 ) 
= ||„l/2 _ p l/2 II 

OO „ ^ 

= IS/ ^^({^b • ■•.»?})- Pl 2 /2 r 2 ({wi. • • • , Wj})] 2 ^l • • • d^j . 

j=0 J0<i»i<-<Wj<T J 

For e > 0, let ©1 „ „ (e) C @| „ „ denote a finite e-net for @| „ „ with respect to the metric 
o ft 2 . This implies that for each (si,ri) G 6? there exists a (59,^0) E ©i^^fe) such 
that ,002 ((si,n), (s 2 ,r 2 )) < e. 



SPIKE TRAIN MODELS 



5 



Now suppose that for each e > 0, there exist measurable nonnegative functions fi e and 
f u>£ on ©|g in (e) x N such that for each (si,tt) G 0| )?) „, there is some (s 2 ,r 2 ) G ©l^nOO 
satisfying 

( 4 ) Pe? ((-5i,ri),(s2,r 2 )) < e, 
with 

/i,e((s2,r 2 ),{«;i,-",«'j V ( T )}) < p si ,n({wi,--- ,wn(t)}) 

(5) < / u , e ((s2,r 2 ),{u;i,---,u; iV ( T )}), a.s., 

and 



oo „ 

{E / [/«i 2 ((*2,^), {«;!,■■■, 

j=0 ■'0<iui<-<w J -<T 

1/2 2 1 

(6) ((s 2 ,r 2 ),{wi,---,Wj})] dw! ■ ■ ■ dwj j < e. 

Definition. For e > 0, the e-entropy of ®\ qn with respect to the metric p@2 is 
defined to be 

#( £ ' L,n>Pe| 9 n ) = log[min{card 9| g n (e): 9| g n (e) is a e-net for ®\ q n 

with respect to the metric p S 2 }] . 

k,q,n 

The e-entropies of &a,q with respect to pe s , and @k,q,n with respect to pe s , „ are defined 
in a similar manner. 

Definition. The e-entropy of Tk,,q,n with bracketing with respect to the metric pp h n 
is defined to be 

H B (e,T~ K ^ n ,pr Rqn ) = log[min{card ®\ q>n {e): (4), (5) and (6) are satisfied}]. 

We observe from Kolmogorov and Tihomirov (1961), page 308, and Dudley (1999), page 
11, that the e-entropy of &a,q,n satisfies 

C~ 

tf(e,9*,5,n,Pe s ,,,J < H(e, 0«, g ,pe s ,,) < 

and hence 

(7) i^e?^,^ j < 2H(-,e k , q , n , Pe ^ n ) < £l/q K ' g , 

where C%,q is a constant depending only on k and q. Thus we conclude from Lemma 10 in 
Appendix A that 

where C« is a constant that depends only on k. 

Next let / : N — ► i? be a nonnegative function such that 

oo „ 

/ /({u;i, • • • , Wj})dtfi • • • dwj < oo. 

j=0 J0<wi< - <Wj<T 
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We follow Wong and Shen (1995) by denning 

Z f ({w 1 ,---,w N{T) }) =log - iT ' 

where s is the true free firing rate function and r the true recovery function. For r > 0, we 
write 

f({w u - ■ -,w n( t)}) 

f{{wi,--- ,wn(t)}), if /({wir--,%(T)}) > e- T p S)r {{w l ,--- ,w N(T) }), 

e~ T p s ,r({ w ir ■ ■ ,w N ( T )}), if /({wi,---,WjV(T)}) < e~ T p s , r ({tt;i,---,w A r( r )}), 

and 

2"/, if > -r, 



(8) Z f = Z f - , 

w J / [ -r, if Z/ < -r. 

Let Zf- t q, n = {Zps 1 .r 1 '■ Psi,n G ^^,11} be the space of truncated log-likelihood ratios (based 
on one observation). Define iT B (e, Zk,q,n, Pz- ) t° be the e-entropy of Z k ^ n with brack- 
eting with respect to the metric 

P Z ii}q<n ( Z Ps 1 ,r 1 ^ Z p a2 ,r 2 ) 

= {^ s ,r[^ sl , ri ({^i,---,^iV(r)})-^ S2 ,, 2 ({^i,---,^iV(T)})] 2 } 1 

00 „ 

= \ Yj [ Z Ps 1 ,r 1 (.{Wll---,W j })-Z ps2r2 {{wi,---,W j })\ 2 

j=0 J0<wi<"-<Wj<T 

xe-fo^yit-vmWqjsiwiyiwi - wi-^dw! • ■■dw j y / \ 
1=1 

where £(t) = maxjfc > : < t} and E sr denotes expectation when the true free firing 
rate function is s and the recovery function is r. We observe from Lemma 11 in Appendix 
A that 

0) H B (s,Z k , q , n , P ^ J < 2^y«C^C k ,/-^)^. 

3 Sieve maximum likelihood estimation 

In this section, we assume that we have n independent identically distributed copies of 
N(t),t G [0, T), with conditional intensity as given by (1). Let these n copies be denoted 
by Ni(t),t € [0, T), and the spike times be written as < w^i < •■■ < w iN .(r) < T, 
i = l,---,n. Inspired by Wong and Shen (1995), we shall first establish a number of 
likelihood ratio probability inequalities. 

Proposition 1 Let < e < 1 and Ca,Ca tq be as in (9). Suppose that 
(10) 0<, + ^c!'«C,,(^)V^<^. 



SPIKE TRAIN MODELS 



7 



Then 



?t Psundwi,!, - • • ,U>i,Ni(T)}) . _ n£ 2 



ne 



2 



P* r is the outer measure corresponding to the density p s r . 

We refer the reader to Appendix A for a proof of Proposition 1. 
Next we define nonnegative functions s„ and rt on t G [0, T) by- 



Ill) ^/s], (t) = VX*) + *n, V r «(*) = VMO + <*n, 

where {0 < S n < 1 : n = 1, 2, ■ ■ ■} is as in Section 2. Since s, r G Ok j(? , we have sJi, rjj G 0K, ? , n 
and pt j £ o n for sufficiently large n. We further observe from Lemma 8 of Wong and 
Shen (1995) and Lemma 14 in Appendix A that 

< Si := E s , r (^- - 1) < C kA 6 n , 

where Ca,i is a constant depending only on h. 

Proposition 2 Let < £ < 1,<% < 1 and that (10) holds. Then 

p;a sup n Psi ' ri( ^' l, ^ , ^ (T) !! > e- 2 / 16 } 

- 4eXp[ "2^250) ]+eXp[ " n( 16"^ )] - 
Proof. First we observe from Proposition 1 that 

p* r .„„ -TT ^i,n({^,ir--,^,jV I (T)}) p -„ £ 2/16l 

s > ri sup 11 ...,„. „ /m a1 - e 1 



1/2 _ 1/2 1| ^ _ 



=iP-J„rt({ w i.l."-,«' i)Jv . (T) }) 



< P; r { sup n^^^'-'-'^^^e-- 2 / 8 } 

+P S r{ f[ P ^ 1 '-"' W ^>\ > e - 2 /i6 } 



2 



A Ps,r({^i,l,- ■ ■ ,Wi,Ni(T)}) 



< 4exD r r TT ^.ra^,i. l^VW > c "^/16l 

* 4eXp[ 2 7 (250) ]+P - { np 4)4 (K 1 ,---,^ i( T)})- e ^ 

Now using Markov's inequality, 

p rTT ^(Kv^Wr)}) > ^/le-. 



SPIKE TRAIN MODELS 



8 



= (1 + 4 )"e-" £2 / 16 



77- 

This proves Proposition 2. □ 
Definition. Let r] n be a sequence of positive numbers converging to 0. We call an 
estimator p§ n ,r n '■ N n — »■ R + a ?? n -sieve MLE of p s>r if (s„,r n ) G 0~ j9jTl and 

j n \ 11 

-5^1og[ps„,f„({«;i,l,- ■ ■ jUfyV^T)})] > SUp -J^loglPaLnKwi.l.-'-^^^T)})]-^. 

i=l Ps 1 ,r- 1 £^ r ft,g,n j =1 

The corresponding s„ and f n are called ??„-sieve MLEs of s and r respectively. 

Definition. If <5 n in Section 2 satisfies <5 n = for n = 1, 2, ■ ■ -, then a 7? n -sieve MLE 
Ps n ,r n is more simply called a r/„-MLE of p Sjr . 

Proposition 3 Let e n > be the smallest value of e satisfying (10) and < 7] n < e^/16. 
tfPs n ,r n is a Vn-sieve MLE of p s;r , then 

PsA\\p/X ~ A 2 h > e n ) < 4exp[-^||y] + exp[-n(| - ft 

where P S:T denotes probability when the true free firing rate function is s and the recovery 
function is r. 

Proof. We observe from Proposition 2 that 

PsA\\pZ n -p^h>e n ) 

< P: r { sup A ^({^ir--,^(T)}) > e _ nnn} 

/ D * r TT Psi,ri{{Wi,l, ■ ■ ■ ,W iN (T)}) _ n£ 2 /16 



\\Ps[ 



2 2 



£ 4 ^i-?(2io)i + ^i-"(i|-*i- 

This proves Proposition 3. □ 

Theorem 1 Let e n > be the smallest value of e satisfying (10), q > 1/2 and < rj n < 
el/16. Ifps n ,f n is a r} n -MLE ofp s , r , then 

E sA\p\'X -Plirh = 0(n^/^ +1 )), asn^^. 
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PROOF. We observe that b\ = for n = 1, 2, ■ ■ ■ (from the definition of ry n -MLE) and e n is 
exactly of order n~ q ^ 2q+1 ^ as n — »■ oo. Hence we observe from Proposition 3 that 

E sA\vJX-p\{?h < en + 8exp[-^|| y ] + 2exp[-n(|-4)] 
= 0{n~ q ' {2q+l ">), 

as n — > oo. □ 
Now we assume that there exists a refractory period in which the neuron cannot dis- 
charge another spike after a spike had been fired [see, for example, Brillinger (1992) and 
Johnson and Swami (1983)]. More precisely, we suppose that there exists a constant 6 > 
such that 

(12) r(u) = 0, Vue[O,0]. 

Then the number of spikes on the interval [0, T) can be at most n$ = \T/0~\ . 

Proposition 4 Let e n > be the smallest value of e satisfying (10) and < rj n < e^/16 < 
(1 — e~ 1 ) 2 /32. If (12) holds and ps n ,f n is a 7] n -sieve MLE ofp Sjr , then 



j- =0 "'0<TOl 



Ps,r({wi, ■■■ ,Wj}) 
<-<W4<T ~Pw„(|Wl,--- ,Wj}) 



21o£f2) e K A (R A +l)T/2 

> [ 6 + (i_e-i)2 +8max{l,log( - ^ )}}e 2 n \ 



( 

* 4exp [-2w ]+exp[ - n( i" 4)] ' 

where R = no V 1. 
Proof. First we observe that 

f P 2 s,r({ w ir-- ,Wj}) 

/ , / 77 TT"^! " " " dw i 

^ Jo< W1 <-< Wj <T Ps n ,r n ({wi, ■ ■ ■ , Wj}) 

= ±f '- a f'" M '-""'*nt.» 3 («.)^-^.)^... A „ 

^ h<w 1 <...<w i <r e ~ /o *»(t)Mt-to C («))dt flj =1 s n (wi)r n (wi - Wi-i) 
< ^— s • 

Now we observe from Theorem 5 of Wong and Shen (1995) that 

\\p]iX-Pl!r\\ 2 2<el<(l-e-^/2 

implies that 

/ JV({u>i, • • • , Wj}) log 77 "-^rzldw! ■ ■ ■ dw 

j _ J0<w 1 <-<w j <T Ps n ,f n ({w 1 ,---,Wj}) 



3 



k 4 (k 4 +1)T/2 



21ogf2 N ) e' 
^ [ 6+ (l-e-i)2 +8max ^ lo g(-^ 
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Hence it follows from Proposition 3 that 
f ne f 

Ps,r\ ^2 / Ps,r({ w l> ■ ■ ■ , Wj}) log[ ^''^p' ^ JJ ^ ]dwi ■ ■ ■ dlO 

1 J=0 J0<ioi<--- 

21 g(2) « 4 (« 4 +l)T/2 



Ps,r({^l,---,^j}) 

77 7TJWUV1 • • • ttu/, 

<tl),-<T PWnd^ir-.U'j}) 



> [6+ (1 _ ^/ )2 +8max{l,log( e )}]4} 



* 4exp [-^io)] +exp ^|-^- 

This proves Proposition 4. □ 
The following theorem is the main result of this section. 

Theorem 2 Let e n > be the smallest value of e satisfying (10), q > 1/2, < r] n < £ 2 /16 
and 5 n = n~ a for some constant a £ (2q/(2q + 1), 1). Suppose that (12) holds and s n ,f n 
are r] n -sieve MLEs of s,r respectively. Then 

fT 

E s , r [ / |l„(t) - s{t)\dt] = 0{n- q ^ 2q+1 ^ log 1/2 n), as n -> oo. 
//, in addition, s(t) > /or a// i G [0, T], i/ien 

£ s , r [ / \f n (u) - r(u)\du} = Oin-t'to+V log 1 / 2 n), as n — ► oo, 
■/ o 

where T* is any constant satisfying < T* < T. 

PROOF. First we observe from (10) that e n is exactly of order n~ q ^ 2q+1 ^ as n — > oo. We 
observe from Proposition 4 and Lemma 15 (in Appendix A) that 



P s , r { min{ 1 , -L}[ / T \s n (t) - S (t)\e-Io^ dt f 

21rW2l p k 4 (k 4 +1)T/2 
— [6 + ^ _ e -i^2 8max{l, log 



?7.£ 2 £ 2 



or equivalently, 

iV{ j T \s n {t) - s{t)\e~ti s{u)du dt < |max{20 J* s{t)e~ s ^ du dt, 200} 

2W2) s 4 (k 4 +i)t/2 ,1/2, 

x[6+ (1 _^/ )2 +8m fl x{l,log( ^ )}]6 2 } } 



> l-4exp[-^]-«p[-»4-«t)]. 
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This implies that 



E s , r [[ \s n (t)-s(t)\e-ti s M du dt] 
Jo 

< |max{20^ s{t)e~ & s( - u)du dt, 200} 



21og(2) e s 4 (s 4 +i)T/2 i /2 

x[6+ (1 _°l^ 2 +8max{l,log( £5 2n B Wn) 



2 2 



+K 2 r{4ex p [ ___|_ ] +exp[-n(^ -4)]}, 



and consequently 



£ a ,r[/ |l„(t)-s(t)|rft] 
JO 

21og(2) M , r e^ 4+1 ) T / 2 ni 2 -, 1/2 



< { max{20^re 2 ^,200e^ r }[6 + + 8max{l,log[ e ]}]4 



J 



(13) + K 2 Te^{4exp[-^|| y ] + exp[-n(| - # )]} 

= O^-^/^+^log 1 /^), 

as n — > oo. Next we assume, in addition, that s(t) > for all t G [0, T]. Let £(t),t G [0, T) 
be as in (86). Since s G @k,q and 

s(t)e- -fo < £(t) < max{a(t),s(t)r(«) : « G [0,T)}, 

we have < mino<t<T^(i) < maxo<t<T £(i) < Thus as in the previous case, 



p 

1 s.r 



/ • r i i , 

20 Jo T Jo W - u)s{t)r(u)e- £-« ^^"^cit ' 200 



x [ / / |s„(t)r n (u) - s(t)r(u)|£(* - u)e~ £-« '("M^+^dudt] 1 



JO 



2l0fff2l e S 4 (S 4 +l)T/2 

< [6 + ^f^s + 8max{l,log(— 



* 1 - 4exp [-2W ] - exphn( i-^ )] ' 



or equivalently, 

r-T /-t 



Ps,r{ ^ ^ |$„(*)r n («) - s(t)r(u)|£(* - u)e-^ s ^ v - t+u)dv dudt 
< |max{20 J* j\{t - u)s{t)r{u)e~ £-« ^^-^"^dudi, 200} 

21offf2) e S 4 (K 4 +l)T/2 -,l/2-, 

x[a+ (r^ +8max{1 -' os( e „g- )}|e -} } 



£ 1 - 4exp [-2wi- exp i-"(it-*!.^ 
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This implies that 

rT r t 



< 



EsA [ I \s n (t)r n (u) - s(t)r(uMt - u)e~ »(«M«-t+«)* ' dudt) 
Jo Jo 

I max{20 J J* £(t - u)s(t)r(u)e~ £-« s{v)r{v - t+u)dv dudt, 200} 

21oef2) e «; 4 (« 4 +i)T/2 1/2 

[6 + ^ _ e -±y ^ { ) °§ 

( " S 2ne 



+K 8 T 2 {4exp[-^^] + exp[-n(^ - 4)]} 



r 2k>g(2) e /« 4 (/c 4 +l)T/2 . i/ 2 

< { max{20R 8 T 2 , 200} [6 + + 8max{l,log( )}]4 } 



(1 _ e -i )2 ' — ^ 



+K 8 r 2 {4e X p[-^g^] + exp[-n(^ - 4)]}, 



and 



E s,r[ / |«n(*)^nH ~ s(t)r(u)\dudt] 

Jo Jo 

r max{20At 8 T 2 e 2s4T ,200e 2s4T } r ^ 21og(2) M , e « 4 (« 4 +iW2 . i 

* { minora*) — - [6+ (r^+ 8m ^ 1 '^(^^)^} 



^8t2 k 4 T 2 2 

= Ofn-'/^+'llog'^n), 
as n — > oo. Hence 

0<t<T 



mh^8(t)]E air [J \f n {u) - r(u)\{T - u)du] 

< E Str [[ f s{t)\r n {u) - r{u)\dtdu] 

Jo Ju 

< E s , r [ \s n (t) - s(t)\ f n (u)dudt] + E s , r [ / \s n (t)r n (u) - s(t)r(u)\dudt] 

Jo Jo Jo Jo 

< K 2 TE s , r [[ \s n {t) - s{t)\dt] + E s , r [ [ f \s n {t)r n {u) - s(t)r{u)\dudt\. 

Jo Jo Jo 

Thus we conclude from (13) and (14) that 

E s , r [ \f n (u) - r(u)\du] 
Jo 

< — . T^EsA \s n (t) - s(t)\dt] 

(T - T*) mm < t <T s{t) J 



+ (T jZyZ^Z 7(t\ Es ^l I \sn{t)r n {u) - s{t)r{u)\dudt\ 

[1-1 ) min < t <T s[t) J J 

0( „-,/(2,+l) l og l/2 n)) 
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as n —> oo. This proves the theorem. □ 

4 Lower bounds 

Suppose N(.) is a counting process with conditional intensity Ai(.|.) as given by (1). Let 
iVi(.), • • • , N n (t), t G [0, T), be independent identically distributed copies of N(t),t G [0, T). 
In this section we shall compute lower bounds on the rate of convergence of an estimator 
for s and an estimator for r based on Ni(t), • • • , N n (t),t G [0, T). 
Let < 6 < T be as in (12) and q be as in Section 2. Define 



f d? 
®e,k,q = {f = 9 2 --geC^T), min g(t) >0, max |^(*)| < «j, j = 0, ■ ■ ■ , q , 



\-^9(h) - ^9(t2)\ < K qo+ i\ti-t 2 \ qi ,Vh,t 2 G [0,T),g(t) = if t G [0,0]}. 

Lemma 1 Lei @n,q,n Q ®k,q such that card(BK,<?,n) < oo. Suppose that s n is an estimator 
for s based on Ni(tj, ■ ■ ■ , N„(t), t G [0, T). T/ien ' 

sup{£ S)r [ f \s n (t) - s(t)\dt] : s G 0*,,, r G 0e,s l(? } 
■/ o 

> ^inf{/ |si(i) -s 2 (t)|di : *i + s 2 ,s 1 ,s 2 G e«,,, n }{l - — —I -r[log2 

2 Jo L log[card(eK 5 g in ) - 1] L 

( } + [card(e„, n )]2 ^ ^ ^ bg p S2 , ri (Kl • • • , w hNtiT) }) \ / ' 



/or any n G ®e,k,q- Next suppose that r n is an estimator for r based on Ni(t), ■ ■ ■ ,N n (t),t G 
[0, T). Let T* be a constant satisfying 9 < T* < T and &e,T*,k,q,n C @e,k, q such that 
caid(Qo,T*,k, q ,n) < oo and n(u) = r 2 (u),u G [T*,T) Wi,r 2 G &e,T*,k, g ,n- Then 



rT* 

sup{.E s , r [ / \r n {t) - r(t)\dt] : s G r G 6*,*,,} 

JO 

1 /- T * 

x(l ^ flog 2 

L log[card(G eiT *,K,g,n) - 1] L 

fie) + I V Te i og ^(Ki.-.w}) ii 

[card(e e , T .,, i(? , n )] 2 jf- ^ S1 * n p Sl , 2 (Ki,-,^,(T)})Jj' 



/or any si G 8^ 



We refer the reader to Appendix A for a proof of Lemma 1. Theorems 3 and 4 (below) are 
the main results of this section. They are motivated by the lower bound results in Yatracos 
(1988). 
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Theorem 3 Let q > 0. Suppose that s n is an estimator for s based on Ni(t), • • • , N n (t),t £ 
[0, T). Then there exists a constant C k , q > (depending only on k and q) such that 

sup{E s , r [ / \s n (t) - s(t)\dt] : 8 G @fi,q, r £ Q e , k , q } > C R , q rr"l^ +1 \ 
Jo 

Proof. Let {b n > : n = 1, 2, • ■ ■} be a sequence of constants that tend to as n — > oo 
and that b^ 1 is an integer. For i = !,■■■ , b" 1 , define 0j j7l : [0, T) — > R by 



(6n r)5[l _ (ffiM)^, if (i - l) bn T < t < ib n T, 

0, otherwise. 



Writing q = qo + qi, qo & nonnegative integer and < q\ < 1, we have 

lim max \-rn<t>i,n{t)\ < oo, Vj = 0, • • • , q , 

d qo d qo 
lim max I- — 4>in(ti) ; — 4>in(t2)\l\ti — t2\ qi < oo. 

Let r, a ,n denote functions of the form 

a[l + ^ 7i ^ n (t)] 2 , Vt€[0,T), 

i=l 

where 7$ = or 1 and a > is a suitably small constant such that H a>n C 6^. If 
si,S2 S H a>n where si / S2, then writing 

(17) ai(t) = a[l + ^ 7M ^ iri (t)] 2 , s 2 (t) = a[l + 72, 4 <Mt)] 2 , 

i=i i=i 

with 71,1,72,1 taking values or 1, we have 



i 1,-1 

0„ 



/ - s 2 (*)|<tt = a |2^(7 M -7 2il )^ n (t) + ^( 7liJ -7 2ii )0 2 in (t)|dt 

> ay [20 ljn (t) + 2 in (t)]dt 

where 

(18) J,= j (l-y 2 ) l dy > 0, V/ > 0. 

Also it follows from (17) that 

| Sl(t) "f (t) | < 2a0 iin (t) + a0 2 n (t) 

< 2a(k n T)" + a{b n T) 2 \ Vt€[0,T). 
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Let r\ G ®o,k,q- Now using Lemma 15 in Appendix A, 

rPsuniiwi,!, - ■ ■ ,Wi,Ni(T)})-, 



E Sl , ri log 



Ps 2 ,n({Wi,l, - ■ ■ ,Wi,Ni(T)}) 

= A^-l-lo g [^]} Sl (t)e-/o-H^ 
Jo s n*J s h*J 

+ T r{^-l-log[^]K(t-^) S i(t)ri(n) e -^-^^ + ^^ 
2 Jo -si (*) 

1 /- T /- f ( 8 i(t) - S2(t) )2 ^ t _ u)si{t y l{u y-Sl u s,{v)r,{v-t + u)dv dudt 

2 Jo Jo siit) 

< a2{b f )29 [2 + (b n Tyf(l + R s T"), 



where k = kq V 1. Finally we observe from Proposition 3.8 of Birge (1983) that there exists 
a subset @k,q,n of ^ a ,n such that 

[ T \si(t) - s 2 (t)\dt > ^-[a{b n TY +1 J q + a(bwr) 9 2 ' +lj29 ], V Sl + s 2 G e R ,,, n , 

Jo ° n ^ 

and log[card(G>K )(?) n) — 1] > 0.316/6 n . Consequently we conclude from (15) that 
sup{£ Sjr [ J \s n (t) - s(t)\dt] : s G 0«, g ,r G 0e,«, g } 

6™ r bg2 + ^^ [2+(6jmi + _ 8r2) jj 



0.316 



16 



Thus we conclude that there exist strictly positive constants Co and C^ q (depending only 
on R and q) such that by taking b n = l/\Con 1 /^ 2q+1 ^ >1 \ , we have 

sup{£ s , r [ / \~s n {t) - s(t)\dt] : s G 9^re Q e ,k,q} > C*,,n _,/(2,+1) ■ 
Jo 

This proves Theorem 3. □ 

Theorem 4 Let q > and 0, T* 6e constants satisfying < 9 < T* < T. Suppose that 
f n is an estimator for r based on Ni(t), ■ ■ ■ , N n (t), t G [0, T). Then there exists a constant 
Ce,k,q > (depending only on 6, h and q) such that 



sup{£ s , r [ / \f n (u) - r(u)\du] : s G @k, q , r G @e,k,q} > Q,*,, 
Jo 



(fl -5/(29+l). 
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Proof. Let / e &e,k, q such that f(u) > if u € [6>*,T*] for constants 6>*,T* satisfying 
< 6> < 9* < T* < T. Then < / := min ue [ e . jT *] f(u) < f := max Mg [ e * T „] f(u) < oo. Let 
{b n > : n = 1, 2, ■ ■ ■} be a sequence of constants that tend to as n —> oo such that 6" 1 
is an integer. Then for i = 1, ■ ■ ■ , define 0j j7l : [0, T) — > E by 

[ [b n (T* - 9*)}i[l - ( 2M ' 2r ^y' r) ) 2 ] g , 
&,n(«) = < if0* + (i - l)6 n (T* - 0*) <u<9* + ib n (T*-9*), 
I 0, elsewhere. 

Let 

^o.n denote functions of the form 

a[f 1/2 (u) + 5>i<M«)] 2 , Vn G [0,T), 
i=i 

where 7$ = or 1 and a > is a sufficiently small constant such that E a>n C @e,k,q- If 
77, r 2 G H a>n where 77 7^ r2, then writing 

n(«) = a[/ 1 / 2 ( u ) + ^ 7l) ^ iin ( ti )] 2 , 

i=i 

in' 

(19) r 2 (u) = a[/ 1 / 2 (7i) + ^ 72il ^ n (n)] 2 , 

i=i 

with 71,1,72,1 taking values or 1, we have 



/ |n (u) — r2(u)\du 
Jo 

,-T* fore b n 

= a / |2/V2( U ) ^( 7lj . _ 72>i )<M«) + £(7^ - l2 MM\ dt 
Jd * i=i i=i 

r e*+b n (T*-e*) 
> a / [2/ 1/2 <M«) + ^i ( U )| du 



2 

where J g is as in (18). Also it follows from (19) that 

Mu)-r 2 {u) < 2ap/ 2 cP hn (u) + a</> 2 » 
ri(it) ~~ f 1 / 2 



2afW[b n {T*-e*)]« a[b n (T* - 6*)] 2q „ _ _ _ 
< 7772" + JTf2 ' VnG[^,Tj. 



Let si £ ®R,q- Now using Lemma 15, 



E si ,n log[ 



Psi,r-2 ({^i,!' ' ' ' ' w i,Ni(T)Y) 
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J J _ i _ log[^j]R(i - «).i(i)n(i.) e -t. ,1 ^^*<fc«ft 

T { !^ _ i _ i og [^% r^(* " ^iWnWe-^ slWri( "- t+ ^to 

r lW r l( u ) in 



< 1 f T * - r 2 {u) . 



/"%(* - u)si(t)n(«)c-^-» 8l(t ' )ri(t '- t+tt)d, 'dtd« 



< 



K 8 6„ 9 (r - g^[2o/V^ - r) g + ab q n (T* - e*) 2 ^ 



where k = k,q V 1. Finally we observe from Proposition 3.8 of Birge (1983) that there exists 
a subset <S>e,T*,k, q ,n of E a ^ n such that 



/ | r i (it) — r-2 (it) | 

J 



> 



86 r , 



a/ 1 / 2 [6„(T*-r)] 9+i J,+ 



,+i , ^ a[b n (T*-9*)]^J 2q 



, Vri / r 2 € Qe,T* 



and log[card(€)g i r* ) K jn ) — 1] > 0.316/6„. Consequently we conclude from (16) that 

rT* 

sup{E SjT [ / \f n (u) - r(u)\du] : s £ 6^ ? , r £ @ e ,k, q } 

> f [ ,./- ( r-.-r J , / (r - , ; r ^ ]{ 1 



0.316 



log 2 + 



V 



}■ 



Hence there exist strictly positive constants C\ and Cg^,q (depending only on 6, k and q) 
such that by taking b n = l/[Cin 1/(2<?+1) l , we have 

-T* 

sup{£ s , r [ / |f n (u) - r(u)\du] : s G ft) „r G 9^} > Ce^n"^ 2 ^ 1 ). 
JO 

This proves Theorem 4. □ 



5 Template matching with continuous kernels 

In the second part of this article, let w = (w^, . . . , w^) be the spike train pattern of an 
assembly of d neurons recorded when an experimental stimulus is provided to a subject, 
where wW = {w±\ . . . , w^.^} are the spike times of the ith neuron over the period [0, T). 
The same neurons of the subject are subsequently observed for a longer period of time when 
it is engaged in other activities and the corresponding spike trains y = (y*- 1 ^, . . . , y^) are 
checked for occurrences of the template w. 

For t > 0, let y t = (y t (1) , . . . , y { t d) ), where y t W = {y - t : y G yW n [t, t + T)}. There are 
various algorithms in the neuroscience literature that have been used to determine if there 
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is a close match between y t and w. In Griin, Diesmann and Aertsen (2002), T is chosen 
small and a match is declared if 

{1 < i < d : wW = 0} = {1 < i < d : yf } = 0}. 

In the sliding sweeps algorithm [cf. Dayhoff and Gerstein (1983) and Nadasdy et al. 
(1999)], a match is declared if 

sup sup inf \y — t — w\ < A, 

where A > is a pre-determined constant. We shall study in this section the pattern 
filtering algorithm [cf. Chi, Rauske and Margoliasch (2003)], which uses a scoring system 
to measure the proximity between w and y^. 

Let / be a non-increasing and non-constant function on [0, oo) with /(0) > 0. The score 
between w and yt is given by 

d 

(20) S t = TS? ) , where S t (i) = T' 1 £ max f(\y - t - w\). 

For a given template w, define the kernel functions 

(21) ffwV)= max f(\u- w\) 1{ < u <t}, Vi = l,---,d. 



w 



6w(0 



Then we can also express 



The graph of against t is thus a normalized sum of the kernels g$ (y — •) over all y G . 
We declare a match between y t and w to be present when the proximity score St exceeds 
a pre-determined threshold level c. To prevent overcounting, a match at time t is declared 
to be new only if the overlap between the time interval [t, t + T) and the time interval of 
the previous new match is less than aT for some constant < a < 1. More specifically, 
let cji = inf{t : S t > c} and a j+1 = inf{t > <7j + (1 - a)T : S t > c} for j > 1. Then the 
number of new matches between the spike trains y over the time interval [0, a + T) and the 
template w is U a := sup{j : aj < a}; with the convention U a = if a\ > a. 

To prevent the occurrences of too many (false) matches when y is pure noise, the 
threshold level c has to be chosen reasonably large. For a large, there can be on the average 
more than one new (false) match between y and w. The Poisson distribution is often 
used for modeling U a to compute the p-value under such circumstances. For small a, the 
occurence of a single match would itself be rare and we can use the probability of having 
at least one match as the p-value. For this purpose, we study the scan statistics 



M a := sup S t , 

0<t<a 



and its dual, the time to detection 



V c := mf{t : S t > c}, 
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which we shall show to have asymptotic Gumbel and exponential distributions respectively. 
In this section, we consider / to be continuous on [0, oo) while in Section 6, we will consider 
the case in which / is not continuous. 



5.1 Main results 



Let yW, i = 1, ••-,(/, be independent Poisson processes with constant intensity Aj > 0. 
Consider the following regularity conditions on w and /. 

(Al) Let w^, • • • , be point processes on [0, oo) with each ergodic, stationary and 
having non-constant inter-arrival times and let = PI [0, T). 

(A2) Let / be continuous and let there be a possibly empty finite set H such that the 
second derivative of / exists and is uniformly continuous and bounded over any interval 
inside ]R + \ H. Moreover, 

d 



(22) 



< sup 



dx 



f(x) < oo and lim f(x) > — oo. 



Let |U W = T 1 J2i=i Jo 9-w ( u ) du be the expected value of St conditioned on w known. 
Let the large deviation rate function of St be 



(23) 



</> w (c) = sup 

6»>0 



9c -T 



d f T n 
1 V Xi / {e 8g ™ {u) - 1) du for c > /i v 
J o - 1 



We shall denote by # w (= # w ,c) the unique value of 9 > that attains the supremum on the 
right hand side of (23). By the stationarity of w*^ in (Al), for all the distribution 

of max wew ( t ) f(\y — w\) is equal to the distribution of Z± := max <jiCiii (i) /(IH)- Hence by the 



ergodicity of in (Al) and the bounded property of / in (A2), 



T 



T -i I e e 9 $(u) du ^ Ee ezi ag w>0 
J o 

and T _1 / g^){u) du — > E'Zj a.s. as T — ► oo. 
io 

Let jU = J2i=i XiEZi. Define the limiting large deviation rate function 

d 

(25) 0(c) = sup he - \(Ee dZi - 1)1 for c> /i. 



(24) 



0>o L 



i=i 



Let 0* (= 0* )C ) be the unique value of 9 > attaining the supremum on the right hand side 
of (25). Then by (23), (24) and (25), /U w — > //, W — ► pointwise on Qu, oo) and 6* w — > 0* 
a.s. as T — ► oo. Similarly, 



i=i 7o 



(26) 



i=i 
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both converge almost surely to positive constants as T — > oo. Let P w denote the probability 
measure conditioned on a known w. 

Proposition 5 Assume (A1)-(A2). Then for any t > 0, A > and c> fi, 

(27) P w { sup S u > c) ~ AC w e~ T0w(c) a.s. as T -► oo, 

t<u<t+A > 

where Cw = (2vr) _1 (r w /w w ) 1 / 2 . 

By pieceing together the local boundary crossing probabilities in (27), we are able to 
obtain the following results. 

Theorem 5 Assume (A1)-(A2). 

(a) Let c > /i. Then the distribution (conditional on w) of C w e _T<?iw ^V r c converges to 
the exponential distribution with mean 1 almost surely as T — > oo. 

(b) Let a — > oo as T — > oo such that (log a)/T converges to a positive constant. Let 
c w > A*w satisfy </> w (c w ) = (loga)/T. Then for any z£l, 

F w {6 w T(M n — c w ) — log Cw > ^} - ► 1 — exp(— e~ 2 ) a.s. as T — ► oo. 

(c) Let a — > oo as T — > oo sac/i i/tai (log a)/T converges to a positive constant. Let c 
(= ct) be such that ry w := a( w e~ T ^ m ^ converges to a constant rj > almost surely. Then 

(28) J p w {[/ a = fe}-e- r ' w ^^0 a.s. Vfc = 0,l,---. 

Remark. Theorem 5 can be extended to deal with the situation in which m > 1 trials 
are conducted, giving rise to m spike train vectors. If we hypothesize that the times of 
recurrence of the template are the same for the m trials, then the pattern filtering algorithm 
is most effectively applied by comparing w against a union of the m spike train vectors. If 
we hypothesize that the times of recurrence of the template is different for each trial, then 
we can compare w against each spike train vector separately and sum up the number of 
new matches for the m trials. The Poisson distribution can again be used to compute the 
p-values. Note that we do not require the vectors y to have the same intensity for the m 
trials. This has implications when one has a spike train y that is nonstationary but can be 
broken up into m segments such that each part has almost constant intensities as we can 
apply Theorem 5(c) separately on each of the m segments. 

Remark. In Theorem 1 of Chi (2004), it was shown [without the regularity condition 
(A2)] that 

lim T -1 log V c — ► 0(c) a.s. for all c > /i. 

The question of whether log V c = T^ w (c) + ©-(T 1 / 2 ) was also raised in a remark on page 
157. Theorem 5(a) provides a more precise answer; that log V c = T0 w (c) + Op(l). 
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5.2 Implementation 

We conduct a small scale simulation study in this subsection to test the finite sample 
accuracy of the analytic approximations in Theorem 5. An alternative to analytic approxi- 
mations is to compute the p-values p w := P w {M a > c} via direct Monte Carlo. However, as 
p-values of interest are often small, a large number of simulation runs is required for these 
estimations to be accurate. The computational cost is compounded when the time period 
[0, a + T) of yM is large. 

We introduce here an importance sampling alternative for the simulation of p- values. 
We use a change of measure argument, which is also used in the proof of Proposition 5, 
by generating y W from an inhomogeneous Poisson process. Analogous change of measures 
for computing p- values have been used in sequential analysis [cf. Siegmund (1976)], change- 
point detection [cf. Lai and Shan (1999)] and DNA sequence alignments [cf. Chan (2003)]. 

Let .Pfl.t denote the probability measure under which yW is generated as a Poisson point 

process with intensity r)i(v) = Xie® 9 ™ ( v ~ t ^ for each 1 < i < d. Note that g$ (v — t) = for 
v [t, t + T) and hence the change of measure occurs only for the generation of spikes in 
the interval [t,t + T). The likelihood of y^ under Pg it is given by 



L e ^)=^ V {-X l f\^^du) [J A 

2/ey t (i) 



(y-t) 



Hence the likelihood ratio 

d 



FT Le ' tiy U ] = ff exp \6TSP - Xi r ( e ^H _ i) du 

tALoMv <=i L Jo 

d „t 



(29) 



= exp \0TSt - V] Xi f (e 69 ™^ - 1) du 
i=i Jo 



In our importance sampling algorithm, we first select a small A > such that J := a/A 
is a positive integer. For each simulation run, we generate j randomly from {0, . . . , J} 
followed by y from Pe w jA- The estimate 



p 



c+i)[E 



3=0 



dP w 



(y) 



l {M a >c} 



(30) 



(J+ l)exp [^A, /V^M " 1) du] {X^THm^c} 

i=l J ° 7=0 



is then unbiased for p w . The averages of (30) over all the simulation runs is then the 
importance sampling estimate of p w . 

Example 1 . Consider the Hamming window function 



(31) 



/(*) 



i(l-/3) + i(l + /3)cos(f) if < i < e, 
-0 if t > e, 



with e = 5 ms and f3 = 0.4 [see, for example, Chi, Rauske and Margoliash (2003)]. 
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Table 1. Estimates of P- w {M a > c}± standard error. 



c 


Direct MC 


Imp. Sampling 


Anal. Approx. (33) 


0.017 


0.037±0.004 


0.0387±0.0019 


0.0383 


0.018 


0.024±0.003 


0.0237±0.0012 


0.0241 


0.019 


0.016±0.003 


0.0158±0.0008 


0.0149 


0.020 


0.009±0.002 


0.0095±0.0005 


0.0091 


0.021 


0.005±0.002 


0.0054±0.0003 


0.0055 


0.022 


0.003±0.001 


0.0033±0.0002 


0.0033 



We generate a template w over the time interval from to T = 500 ms on d = 4 spike 
trains, with interarrival distance X ms between two spikes on each spike train satisfying 

(32) P{X < x} = 1 - e"^- 1 ^/ 24 . 

This corresponds to an absolute refractory period or "dead time" of 1 ms after each spike 
in before the next spike can be generated. In our computer experiment, a total of 80 
spikes were generated on the four spike trains using (32). 

To compute the p-values using direct Monte Carlo, we generated 2000 realizations of 
y by using Poisson point processes with constant intensity Aj = 0.04 ms -1 on the interval 
from to a + T = 20 s. The proportion of times that {M a > c} occurs is taken as the 
estimate of p w . For importance sampling, 2000 simulation runs were also executed using 
the algorithm described earlier by choosing A = 0.2 ms. The following thinning method is 
used to generate the spike times in the interval [jA,jA + T), where the intensity of 

y(0 i S 

not constant under Pq^ja'- 

1. Let yj^ = {ui, . . . ,un} be generated on [0, T) as a Poisson process with constant 
intensity Xie 6 ™?^. 

2. Generate independent uniform random variables R\, ■ ■ ■ , i?^r on [0, 1] and let 

y f\= {y-f.ye yW n[t,t + T)}) = { Uj G yf : Rj < e e w[ 9 wK)-/(o)] }> 
For the analytic approximation, we apply Theorem 5(a), which gives us 

(33) P w {M a >c} = P W {V C <a} = l- exp(-aC w e- T ^ (c) ). 

We see from the results summarized in Table 1 that there is substantial variance reduc- 
tion when importance sampling is used. The analytic approximations have also been shown 
to be quite accurate, lying within two standard errors of the importance sampling estimate 
in all the cases considered. 

5.3 Proofs 

We preface the proofs of Proposition 5 and Theorem 5 with the following preliminary 
lemmas. We shall let [ - J denote the greatest integer function. Let Pe m j be the change of 
measure defined in the beginning of Section 5.2 and let Pg w = Pg^fl- 
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Lemma 2 Let t > and c > /i w . T/ien 

P w {S t > c} ~ (2vrt; w )- 1 / 2 w 1 r- 1 / 2 e- T ^ (c) a.s. as T ->■ oo. 

Proof. Let Egt denote expectation with respect to the probability measure Pet- Let 
It = [zt, zt + et) with e T = o(T-^ 2 ). Then by (23) and (29), 



P^iT^iSt-c) eI T } = E e „ 



dpy~ {Tl/2{St ~ c)elT} 



-T0 w (c), 



(34) 



= e ~—>E e ^ t 



™ w (c-S t )-, 

e 1 {Ti/2(5 t _ c)e / T } 



By similar computations, for any |/£K, 
(35) P w {^>c + y} = e- T ^( c )^ wit 



p T0 w (c-S t ) 1r 

e i {St>c+ J ,} 



< e -T0 w (c)-T6/ w2 /_ 



Under P0 Wj t, TS 1 ^ is compound Poisson with Poisson mean % = Aj f Q T e^^^du and each 
summand is identically distributed as g$(Ui), where Ui is a random variable on [0, T) with 
density (Xi/rn)e 6w9 ^^ . We note that 



E 6* 



J J 



"' u ) e ew 9 «(«) dn= ^ F X-e^M du 
d0 Jo 



Since 6* w maximizes the right hand side of (23), it follows that 

(36) E^tiSt] =T- 1 J2viEeJ$(U i )} = T' 1 ^ X t j\e e ^^ - 1) du 



c. 



Since a compound Poisson Y = Ylf=i Yj nas variance Var(Y) = (EN)(EY 2 ), it follows 
from (26) that 

(37) Y^,t{St)=T- 2 Y J ^ / [gtt(u)] 2 (\ t / Vl )e e ^^du = T~ 1 v w . 

i=i ^ 

By (36) and (37), T 1 / 2 (S' t — c) is asymptotically normal with mean and variance t> w . 
Hence by equation (5) of Stone (1965), 

(38) Pe m A Tl/2 (St - c) € M = (2^ w r 1/2 / e- z2 /(2-w) dz + 0T (i)( £r + T -i/2) a.s. 

as T — > oo, where ot(1) is a term not depending on et and zt- Let e^T 1 / 2 tend to slowly 
enough such that o t (1)/(e t T 1/2 ) -► 0. Then by (38), 



(39) P ewit {T 1 / 2 (^ - c) € 7 T } ~ (2™ w )- 1/2 / eT^ 2 ^ a.s. as T 



oo, 
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if I T is uniformly bounded. Then by (34), (35) and (39), 



P w {St > c} = < T^HSt -c)<(k+ l)e T ] + P W {S T > c + T' 1 ' 2 } 



k=0 

(27T Uw )- 1 /2 e -^w(c) 



/>oo 



and Lemma 2 holds. □ 

Lemma 3 Assume (A1)-(A2). There exists et = o{T~ 1 / 2 ) such that for all uniformly 
bounded intervals I\,t-,Ii,t of length et, 

(40) ~ (2»)- 1 (»wT„)- 1/2 ( / e-*< 2 "-)(( 2l )( / „-«§/<»■.> <fc 2 ), 
almost surely as T — > oo. 

Proof. By stationarity, we may assume without loss of generality that t = 0. Under 
P, w , the vector (TS®,T£S®\ x =oY is bivariate compound Poisson with Poisson mean 

f]i = \i Jq e 6 ™ 9 ™^ du and with each summand identically distributed as 

where Ui is a random variable on [0,T) with density (Aj/7/j)e ew9 ™ )(u ). By (A2), -^Sx^\ x =o 
exists almost surely. 

We shall now compute the means and covariances of 



d 



dx 



dx 



<#(«) 



under P# w . Since 

Eg. 

(i) 

and <7w is bounded, it follows that 



d (i) 
du 



u=Ui 



' r d (i) 



du 



<#(«) 



-(KMe^ie^^-e^W), 



(41) 



Eo 



d 
.dx 



x=0l 



T-^mEe 



i=l 



The bivariate compound Poisson (Y, Z)' = Y.f=i( Y j> z jY , where ( y i> ^i)'> ■ ■ ■ > ( y JV, 

are independent identically distributed summands conditioned on an independent Poisson 

random variable TV", has covariance matrix 



Cov 



E(N) 



( E(Y 2 ) E(YZ)\ 
\E(YZ) E(Z 2 ) J ' 
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It follows from the relation 



u v 



that 



u=Ui 



r 



o 



d 



du' 



{\/m)[e^gt\u) - 0-V 9 ~ (u) u T = 0{T- 1 ) a.s., 

u=0 



Cov flw (5 , ^S*U=o) = -r-^^^^O^^HI^] = 0(^ 2 ) a.s. 

1=1 

and hence by (26), 

,42, cOT »-(^iL)~ r " i ("o :) 

It then follows from equation (5) of Stone (1965), (36), (41) and (42) that 

Pe„A Tl/2 (St - c, ^S x \ x=t ) G h,T x I 2 ,t} 
= (2vr)- 1 ( Ww r w )- 1 / 2 ( / e-'V^dzAff e'^ 2 ^ dz 2 ) 
+0T (l)(e 2 T + T- 1 ), 

where oy(l) does not depend on IjT, 3 = 1> 2. Then Lemma 3 follows by selecting et such 
that stT 1 / 2 -> and o T (l)/ef,T 0. □ 

Lemma 4 Lei k, T, if and c 6e positive constants. Let 

s(u) = c + z{L- l l 2 + UZ2T- 1 / 2 - ^K. 

Then sup 0<u<f , T _i/2 s(u) > max{c, s(0), s(kT -1 / 2 )} if and only if k > z 2 /K > and z\ > 
-z 2 /{2KT 1 / 2 ). 

Proof. Since the quadratic s has a unique maximum at u = z 2 /{KT l l 2 ), it follows that if 
k > z 2 /K > 0, then 



0<«< K T-V2 vaj-/-/ j-'" 2KT 

and Lemma 4 easily holds. □ 
Lemma 5 Assume (Al)-(A2). Then for any k > 0, t > and c> ji, 
(43) P w { sup S^^max^St.^r-i/a)} ~/eT- 1 / 2 C w e" T * w(c) a.s., 

u ,/ iere Cw = (27r)- 1 (T'wAw) 1/2 . 
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Proof. Assume without loss of generality t = 0. Let Hi (= i/j )W ) be the set of all v such 
that a second derivative does not exists at g$) (v). Note that by (A1)-(A2), the number of 
elements in Hi is 0(T) a.s. for all i. Let < u < kT^ 1 ! 2 and let y G y^ be such that 
y — h ^ (0,u) for all h £ Hi. Then by (A2) and the mean value theorem, 



(44) 



9w (y-u)- g$ (y) + u-^-g^) (v] 

dv 



v=y 



2 dv 2 



for some y — u < ^ < y. 

If y G is such that y - he (0,u) for some he Hi\ {0, T}, then 



ffw (y - «) - (y) + u^-g§ (v) 



dv' 



v=y 



v v=y az, 



(45) 



/ ( 

(h + u-y)(±g$(v) 



d_ 

vih d^ 



fh )+o(u 2 ). 



Finally, for completeness, we consider y G such that either y — T G (0, u) or y G (0,u). 
Then we write formally 



(46) {y-u)- $ (y) +u±g$ (v) = $ (y-u)- $ (y) + u±g$ (v) 



dv' 



v=y 



dv' 



v=y 



By adding up (44)-(46) over all y G yW for i = 1, • ■ ■ , d and dividing by T, we obtain 



(47) 



S u — So — u—S v 
dv 



C-w tU u 



v=0 



where C w ,u is an expression derived from the right hand sides of (44)-(46). It shall be shown 
in Appendix B that 



(48) 



lim sup u T 



T ^ oo 0<«<kT- 1 /2 

Then by Lemma 3, the change of variables 



a.s. under Pg w - 



d 



Zl = T^ 2 (S - c) and z 2 = T 1 / 2 —S x 

dx 

substituting K = 6> w t w into Lemma 4, (47), (23) and (29), 
P w { sup S u > max(c,5 ,5 rer -i/ 2 ) [ 

= Eg 



x=0 



0<m<kT- 1 /' 
r dP 

( \i 

iyj 1 {sup o<ii<KT _ 1/2 S u >m&x(c,S ,S kJ ,_ 1/2 )} 



ldP e 

e -T4>™{c) Efj 
-T0 w (c) 

w% POO 



pTe^c-So), 
c A {sup 



-i/2 •S , n> max (c,So,S KT _ 1(/2 )} 



< • ' "(277) '(tVTw) 



-1/2 



X 



^/(20 w r w Ti/2) 



e -TV^«-^/K^/2T W(fei ^ 
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~ e -^w(c) (27r) -i (UwTw) -i/2 



w ' w 



X / e -S/2r w( _ T -l/2 0w l e -TV 2ew2l) 




2l=00 



d,Z2, 

2 i=- 2 2/(29 w r w T 1 /2) 



and indeed Lemma 5 holds. □ 
The proof of the next lemma will also be shown in Appendix B. 

Lemma 6 Assume (Al)-(A2). Let 

A t = I sup S u > max(c, S t , S t+KT - 1/2 ) \. 
L <<«<t+ K T- 1 /2 > 

Then there exists r K = o(k) as n — > oo such that for all t > 0, with probability 1, 

(49) £ P w (A t n A t+eKT . 1/2 ) < r K T^e~ T ^ 

1<£<T 3 /2/ k +1 

for all large T . 

PROOF of Proposition 5. By stationarity, we may assume without loss of generality 
t = 0. By Lemmas 2, 5, 6 and the inequalities 

LA/(kT- 1 /2)J_i lA/(KT- 1 / 2 )\-l-q 

g=0 £=1 

< P w I sup 5 U > C }• 

L 0<m<A ' 

LA/( K T-V2)j LA/(«T-V2)j+i 

< ^ Pw(^ 9K T-l/2) + X] P w{5 9kT -i/2 > C}, 
g=0 9=0 

it follows that for any < e < 1, there exists k arbitrarily large such that 
(LA/(kT- 1 /2)J _ i) [(i _ £ ) K T- 1 / 2 C w e- T ^ - r K T- l / 2 e~ T ^ 

< P w \ sup S u >c\ 
L 0<«<A ' 

(50) < (LA/(kT- 1 / 2 )J + 1)(1 + e) Lr-^Cwe-^W + (2vr« w )- 1 / 2 w 1 T- 1 / 2 e - T ^ (c) 

holds for all large T with probability 1. Select k large enough such that (50) and the 
inequalities 

r K < e«Cw, (2vrw w ) _1/2 w 1 < ek( w , 
holds for all large T with probability 1. Then by (50), the inequality 



(51) 



P w {sup 0<u < A S u >c}_^ 



A( w e~ T ^( c ) 



< 3e 



holds for all large T with probability 1 and (27) follows from (51) by selecting e > 
arbitrarily small. □ 
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Proof of Theorem 5. Let z e R and let £ (= £ w ) be such that £/T — > oo and 
k (= k w ) : — z ^r<i>vi(c) /£ w £ ig a positive integer tending to infinity almost surely. Define 
= {sup (i _ 1)? <i <i? „ T S' t > c} and C,- = {sup i5 _ T < t < i5 5< > c}. Then 

fc k k 

(52) P w ( |J < P w {Cwe- T ^ (c V c < z} < P w ( (J Bj) + £ P w (C j ). 

Conditioned on w known, the event Bj depends only on the spike train times of y W lying 
inside Since these intervals are disjoint for different j, it follows that B±, ■ ■ ■ , B^ 

are independent conditioned on w. By Lemmas 5 and 6, it follows that with probability 1, 

(53) P w {Bj) ~ (£ - T)C w e~ T<Mc) ~ z/k, P w (Cj) ~ TC w e" T0w(c) for all 1 < j < k. 

Since k — > oo a.s. as T — > oo, with probability 1, 

fc fc 

(54) P w ( (J Bj) = 1 - P~{Bf) = 1 - (1 - ^A) fc + - 1 - 6-. 

3=1 3=1 

Moreover because £/T — > oo, it follows from (53) that with probability 1, 

k 

(55) E P w(C,) ~ Tfc( w e-^ (c) = o(l). 

3=1 

Theorem 5(a) follows from (52), (54) and (55). 

To show Theorem 5(b), we use the Taylor expansion 



/ Z + l0gCw\ , , v . a /Z + l0gCw\ . ^^-2x 

</>w(^C w + — — — J = (Mc w ) + #w(^ — — )+0(T ) 

(56) " = r- 1 [z + io g (aCw)] + o(r- 2 ). 

By the computations in (52)- (55), it follows that with probability 1, 

P w {M a > c w + * + g lQ |5 W } = ^w{V Cw+ [ z+ i ogCw ]/( ewT ) < a} 

(57) W = l-exp[-aCwe- T ^ (Cw+[2+l0gCwl/(ewT)) + (l)", 



and Theorem 5(b) follows by substituting (56) into (57). 

It remains to show (c). Let U a = Yl\=P ^-B y Then by (53), E w \U a } — — > a.s. and 
(28) holds with U a replaced by U a , since Bj, 1 < j < \_a/£\ are independent events and the 
limit of sum of binomial random variables is Poisson. By (53) and (55), 

L«/€J La/CJ-i 
(58) P w(Cj) = o(l) and ^ P w (-Bj n = o(l) a.s. as T ^ oo. 
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Moreover, by Lemma 6, with probability 1, 

[a/{KT- 1 / 2 )\ |T 3/2 /«+lJ 

(59) E | E ^(V'/^V)^ 1 ' 2 ) 

q=0 e=[(l-a)T s / 2 / K \ 



< rj w r K 



for all large T, where aT is the maximal permitted overlap between two matches. By (58) 
and (59) with k arbitrarily large, we can conclude that U a — U a — ► in probability and 
hence (28) holds. □ 

6 Template matching with kernels containing discontinuities 

In this section, we obtain analogues of Proposition 5 and Theorem 5 when the score function 
/ contains discontinuities. A typical example is the box kernel 



(60) f(x) 



1 if x < e, 

—(3 if x > e, 



where (3,e are positive real numbers. Instead of (A2), we have the following regularity 
condition on /. 

(A2)' Let / be a discontinuous function and let there be a finite set H such that the first 
derivative of / exists and is uniformly continuous over any interval within M. + \H. Moreover, 
(22) holds. 

Under (A2)', the values of / may be concentrated on 0, ±q, ±2q, ■ ■ ■ for some q > 0. 
Definition. Let L(f) = {/(x) : x > 0} be the range of /. We say that / is arithmetic 

if 

(61) L(f) C q7L for some q > 0. 

Moreover, if q is the largest number satisfying (61), then we say that / is arithmetic with 
span q. If (61) is not satisfied for all q > 0, we say that / is nonarithmetic. 

For example, if j3 in (60) is irrational, then / is nonarithmetic while if [5 = s/r for 
co-primes r and s, then / is arithmetic with span q = r~ l . We write for i = 1, ■ ■ ■ , d, 

gw(u+) = limg!$(v), 

viu 

k{u) = gw(u-) - s$(u+), 
A = {ue (0,T) :Si(u) /0}, 

where g$ is as in (21). 

Let 4> w , W , f w and fi be as in Section 5.1. If Di / for some i, we can define to be 
the probability mass function taking values in {Si(u) jueA, i<i<d with 

(62) hi( X ) = E ^ E ^ >w iw.w /E Ai E e ^ )(u_) - 

i=l u£Di i=l u£Di 
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Let E* denotes expectation when X\, X2, ■ ■ ■ are independent identically distributed random 
variables with probability mass function h^. Define 

(63) u b = inf{n : X x + V X n > b} and R b = X 1 + ■ ■ ■ + X^. 

Then the overshoot constant 

(64) i/ w := lim E*e- e ™ (Rb ~ b \ 

where b is taken to be a multiple of \ if is arithmetic with span x- Note that the state- 
ment ll h^ is arithmetic with span implies that {d~i(u)} U £E>i,i<i<d C x^- The constants 
f w have been well-studied in sequential analysis, see for example Siegmund (1985) for the 
existence of the limits in (64). Let us define the asymptotic constant C w ( n °t depending on 
y) by 

d U) 

(65) C = (27rTv vv y 1 / 2 u vv K w J> ^ Hu)e e " 9 " ^\ 
where 

1 if is nonarithmetic, 

g^(l — e~ SwX ) if h%, is arithmetic with span / is nonarithmetic, 
x ( i^e-^wg ) ^ Kv i s arithmetic with span %, f is arithmetic with span q. 



K w 



Since we can express each 5i(u), u 6 Di in the form g\ — g<i for 51,52 £ L(f), it follows that 
if / is arithmetic, then is arithmetic and is a positive integer. Analogous to Lemma 
5 and Theorem 5, we have the following asymptotic results. 

Proposition 6 Assume (Al), (A 2/ and let A > 0, t > 0. If f is nonarithmetic and c > fi, 
then 

(66) P w { sup S u > c) ~ ACwe" T0w(c) a.s. as T^ 00. 

t<u<t+A > 

If f is arithmetic with span q, then (66) also holds under the convention that 

(67) Tc (= Tct) £ q% with c — ► d as T — > 00 /or some c' > /i. 

Theorem 6 Assume (Al) and(A2)' and let f be nonarithmetic. 

(a) Let c > /i. Then the distribution (conditional on w) of Cw e_T ^ w ^ c ^^c converges to 
the exponential distribution with mean 1 almost surely as T — > 00. 

(b) Lei a — ► 00 as T — > 00 swc/t i/iai (log a)/T converges to a positive constant. Let 
c w > A*w satisfy </> w (c w ) = (loga)/T. T/ien /or any zGl, 

P w {0 w T(M a — Cw) — log Cw > z l ~~ * 1 ~~ exp(— e~ 2 ) a.s. as T — > 00. 

(c) Zei a — ► 00 as T — > 00 suc/i t/iat (log a)/T converges to a positive constant. Let c 
(= ct) be such that ry w := aC w e~ T< ^ w ^ converges to a constant rj > almost surely. Then 

k 

(68) P w {C/ a = fc} - e -^Ik _> a.s. V fc = 0,l,---. 

fc! 

If f is arithmetic with span q, then (a) and (c) also hold under the convention (67). 
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Table 2. Estimates of P VJ {M a > c}± standard error with a + T = 20s. 



c 


Direct MC 


Imp. Sampling 


Anal. Approx. (69) 


0.065 


0.029±0.004 


0.0300±0.0016 


0.0289 


0.066 


0.019±0.003 


0.0218±0.0012 


0.0207 


0.067 


0.012±0.002 


0.0140±0.0008 


0.0144 


0.068 


0.008±0.002 


0.0103±0.0006 


0.0101 


0.069 


0.005±0.002 


0.0067±0.0004 


0.0070 


0.070 


0.003±0.001 


0.0051±0.0003 


0.0047 



Example 2. We shall conduct here a simulation study similar to Example 1. The 
generation of w and y are similar to Example 1 but the box kernel (60) is used instead of 
the Hamming window function (31) when computing g$) ■ We chose parameters e = 4ms 
and (3 = 0.3 in (60). Hence / is arithmetic with span q = 0.1 and h%, is arithmetic with 
span x = 1-3. In fact, is positive only on the values —1.3 and 1.3 and hence z/ w = 1. 
In the template w that was generated, there were a total of 2 x 59 elements in \J i Di with 
half of all u € A satisfying 5i(u) = 1.3 and the other half satisfying 5i(u) = —1.3 for each 
i. Hence 

/4(-1.3) = e -o.30 w /( e e w + e -o.se w) and ^(1.3) = e *w/( e *w + e -o.30 w)> 

These information are then used in the computation of the constant £w in the analytical 
approximation 

(69) P w {M a >c} = P W {V C < a} = 1 - exp(-aCe- T< M c )), 

an analogue of (33) that follows from Theorem 6(a). 

In Table 2, we compare the analytical approximation (69) against both direct Monte 
Carlo and importance sampling with 2000 simulation runs being used to obtain each entry. 
The variance reduction when importance sampling is used is similar to that seen in Exam- 
ple 1, and the technique is indeed an effective time saving device for computing p- values 
especially when the p-values are small. The analytic approximations are also accurate and 
agree with the simulation results that were obtained. 

In addition to the above simulation study, we also conducted a similar exercise to check 
the accuracy of the Poisson approximation of U a in (68) , this time with a + T = 200s and 
threshold level c = 0.0614. The maximal proportion of overlap between two matches is 
chosen to be a = 0.8. The analytical approximations are compared against 2000 direct 
Monte Carlo simulation runs and the results are recorded in Table 3. Again we see that the 
analytical approximations are quite accurate and this indicates the usefulness of using the 
asymptotic results in Theorem 6 to estimate p-values. 

We shall now prove Proposition 6 and Theorem 6 via the following preliminary lemmas. 

Let 

(70) M*) = X> E e^^h {Si{u)=x} /X>£ e^(«+). 

i=l Ji££)j i=l u(^Di 

Then /i w and [see (62)] are conjugate probability mass functions in the following sense. 
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Table 3. Estimates of Pw{U a = k] and r/ w = £7 w (£/ a ). Standard errors in parentheses. 



k 





1 


2 


3 


4 


5 


> 6 




Approx. (68) 


0.336 


0.366 


0.200 


0.073 


0.020 


0.004 


0.001 


1.09 


Direct Monte 


0.328 


0.363 


0.195 


0.084 


0.024 


0.005 


0.001 


1.13 


Carlo 


(0.011) 


(0.011) 


(0.009) 


(0.006) 


(0.003) 


(0.002) 


(0.001) 


(0.02) 



Lemma 7 Let Di^ty for some i. Then there exists 7 W = 1 + 0(T r ) a.s. such that 



h* w (x) =j w e y ™ x h w (x) Vx. 

(i) (i) (i) (i) I '\ 

Proof. Let u £ Di with w- < u < w-^ for adjacent spikes w- ^w-^ € w w . Then 



by the symmetry of g$) in the interval (Wj ,> , wj+i) about its mid-point {w K - L> + w^ 1 )/2, it 

= g$(u+). Hence 7 W , which 



,(<) 



follows that v :- 



i+i 



(y — ^4^) ^ an( i that 5w ; (v 



(0, 



we define here to be the ratio of the denominators on the right-hand sides of (62) and (70), 
is 1 + 0(T _1 ) a.s. with the 0(T~ l ) coming from u £ Di occuring before the first spike or 
after the last spike in wW . Since 



Lemma 7 holds. 



□ 



Lemma 8 Assume (Al), (A2j and let k > 0. Then for all e > 0, there exists k large 
enough such that for any for any t > 0, the inequality 



P w {S t < c,sup t<u < t+KT -i S u > c} 



- 1 



holds for all large T with probability 1, where the constant is defined in (65) and c > /x 
if f is nonarithmetic; c satisfies (67) if f is arithmetic with span q. 

PROOF. Assume without loss of generality t = and let Gi = {j ve oS v,v K '^~ 1 ]- We can 
write TSq = TS' + Jo, where 



S 'o=T- l E E ^ ) (2/)andJ = ^ £ <£>(y). 



i=l j,ey(0,j,gGi 



1=1 yey (i) nG . 



The random variables S' Q and Jo are independent because they are functions of the Poisson 
processes y W over disjoint subsets of the real line. Let us first consider / arithmetic with 
span q. Then TS' and Jo are both integral multiples of q. Since / is constant between 
jumps, we can express TS U = TS' + J u [see (20)] where 



(71) 



i=l yey {i)^Q. 
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Hence both Sq < c and sup 0<M<KT -i S u > c occurs if and only if 

(72) sup (J u — Jo) > iq and S' Q = c — T~ l kq for some integer £ > 1 and k = Jo/q + £■ 

0<u<kT- 1 

Since E0 w [S' o ] = c + 0(T _1 ) a.s., by the local limit theorem for lattice random variables 
[see, for example, Theorem 15.5.3 of Feller (1971)], 



(73) Pe v {S' = c-T- 1 qk}^ 



(2t:Tv w ) 1 / 2 



a.s. 



for any integer k. Since S' and (Ju)o<u<kT~ 1 are independent, it follows from (72), (73), 
(23) and the change of measure (29) that 



P w \ S < c, sup S u > c\ 



0<u<kT~ 

= Eq ™ [^^(y)l{5 <c,sup 0<u < KT _ 1 S u >c}_ 

oo 

(74) ~ g , 1/2 e- T ^Y,^ q PeJ sup (J u -J )>£q\. 

(2vrTi; w ) 1 /^ ^ 1 <«< K T-i J 

Since <?w is constant between jumps, the graph of (J u — Jo) against u is also piecewise 
constant with jumps at all u for which y — u £ Di for some y G yW, 1 < i < d [see (71)]. 
Let iV* be the total number of spikes in Ui<j<d(y^ n ^i)- Then there are N* such jumps 
and 

sup (J u - J ) = sup (Xi H h Xj) 

0<«<kT-! l<j<N, 

where Xj is the jth jump and has probability mass function /i w . Moreover, X±, X2, ■ ■ ■ are 
independent conditioned on JV t , a Poisson random variable independent of the JQ's with 
mean 

d 

(75) EN, = kT' 1 Ai e 9w9 ™ {M+) . 

i=l ueDi 

If r e {0, ■ ■ ■ , x/g — 1} and s G Z + , then R sx -rq = Rs\ [see (63)]. Let i£* and P* denote the 
expectation and probability measure respectively when X±,X2, - ■ ■ are independent iden- 
tically distributed with probability mass function h^- Then it follows from a change of 
measure to P* and Lemma 7 that 



Ww^p r sup ( Ju -j )>e q \ 

e=1 1 0<u<kT-! > 



00 



7- 1 [ e -^ ( ^-^ ) l { su Pl<J<JV ,(x 1+ ... + x J )>M 
00 

7w a *[ 2^ 1 {suPi< J <iV»(^i+-+X j )>sx} 
r=0 s=l 
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(76) 



x/<?-i 

r=0 



sup (Xi + ---+Xj) 

l<j<N* 



Since Xi has positive mean under for all large T and the almost sure limit of E„ N [see 
(75)] is proportional to k, it follows that there exists k large enough such that 



(77) 



E* [sup 



l<j<N. 



r (X 1 + --- + X j 



(EN^E.X^ 



- 1 



< 



for all large T. Since (q/x) Er=o 1 eT e ™ rq = i^w, Lemma 8 then follows from (62) and (74) 
to (77). When / is nonarithmetic, the local limit result (40) with 22,T = ^ and t = is 
used in place of (73). □. 

The next lemma, needed for the proofs of both Proposition 6 and Theorem 6, will be 
proved in Appendix B. 



Lemma 9 Assume (Al) and (A2)' . Let 



A t = I S t < c, sup 

L t<u<t+nT- 



i ^ M >c}. 



Then there exists r K = o(k) as k — ► oo such that for all t > 0, with probability 1, 



T0 w (c) 



for all large T. 

Proof of Proposition 6. By stationarity, we may assume without loss of generality 
t = 0. Then (66) follows from Lemmas 2, 8, 9 and the inequalities 

|_A/(kT _1 )J-1 LA/^T-^j-l-g 

P v( A qKT-l)- ^2 P v( A qKT-l n A (q+e) K T-l) 

q=0 1=1 
LA/(«T-i)j 

< P w { sup S u > c) < V P w (A 9kT -i) + P W {S > c}, 

U<«<A ^ 



with k arbitrarily large; see for example, the proof of Proposition 5. □ 
Proof of Theorem 6. The proof of Theorem 6 proceeds as in the proof of Theorem 
5. The only modification needed is the replacement of £ w by £w- For the proof of Theorem 
6(c), we will also need to replace (59) by 



[a/(KT-i)} [T 2 /k+1\ 
q=0 £=[(l-a)T 2 /K\ 



< 



T l/2 



□ 
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8 Appendix A 

Lemma 10 Let 0? „ and p ft 2 be as in Section 2. Then for each (si,ri) G 

K,q,n i u S ] Il n,q,n> 

CO 

{ Yl / sup [PsSi({^i>---,^}) 

k =0 ^0<«;i<-< TO:/ <T p e2 ((si,r 1 ),( S2 ,r 2 ))< £ ,( S2 ,r 2 )e0? „ 

K,q,n ' ' 

1/2 



-rf 2 2 r 2 ({^i> • • • , Wj})] 2 dwi • • • dm*} < eC k , 



where C k > 1/2 is a constant depending only on k. Here f* denotes the upper integral [see 
for example Dudley (1999), page 94]- Consequently, 

(78) H B (e,F~ K , q , n , P ^ q J < H(—,el q ^ P@l J < jrjt^ 1 - 

PROOF. Let R = n V 1. We observe from (2) that 

\plun ({^l. ■■■,v>j})- Pli%t({m, Wj})\ 

i=i 

_ e -/„ T S2 (*)- 2 (*--CW)^/2]j4/ 2 ( Wi ) r V 2 ( Wi _ 

i=l 

where = max{A; > : wt < t}. Since /j@2 (s2j?"2)) < e, we have 

\s{ /2 (w^rl /2 (wi - Wi-i) - sl /2 (wi)rl /2 (wi - u>i-i)\ < 2sR, Vi > 1, 



and 



| e -Jo si(t)ri(t-tu c(t ))rft/2 _ g -/ s 2 (t)r 2 (t— iu c(t ))dt/2| 
< |1 _ e So[ s ±( t ) r i{ t - w at))-S2{t)r 2 {t-w at) )}dt/2^ 

3rr\i 



(2eR s T) 
< 2eR 3 T}±-. — 

Consequently for j = 1, 2, ■ • 



/ 

JO 



SUp [ e -/ ( r s i(*) r l(*- tl 'C(t)) d '/2 



0<wi<-<Wj<T p e2 ((si,ri),(s 2 ,r2))<e,(s2,r 2 )6e| 

K,q,n 
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3 

s] 12 {wi)r\ /2 {it)i - Wi-i) 



X 

1=1 



e - J T «2(t)r2(t-to f(t) )dt/2 -Q s l/2^.^1/2^ _ u ,._ l) ]2 du , i . . . ^ 
i=l 



i=0 
2 



and for each (si,n) G e^Je), 



{E / SU P IplunO^ir-^Wj}) 

( j=0 J0<w 1 <-<w j <T p e? ((si,ri),(s2,r 2 ))<e,(s2,r 2 )6e? ]9i „ 



■Paa.raCW. ' ' ' > w j»] du, l ' ' ' dw 3 \ 

(z + 1)! j! 



j ii i=0 

< eC k , 



where C k is a absolute constant depending only on k. (78) now follows from Lemma 2.1 of 
Ossiander (1987) and (7) since C k > 1/2. □ 

Lemma 11 W^i/t i/ie notation of Section 2, we have 

H B {e,Z k ^ n ,p2 Rqn ) < H B ( 2 , Fa^n, pr k , q , n ) 

< 2 («+ 2 )/«c3 /9 C*, ? (^) 1 /«, Ve>0. 
PROOF. For % = 1,2, let fi : N —>■ R be a nonnegative function such that 

oo „ 

E / fi({wi, ■ ■ ■ ,Wj})dwi ■ ■ -dwj < oo. 

j=0 J0<wi<-<Wj<T 

Define for j = 0, 1, ■ ■ ■ , 

= {{wi, ■••,«>?} : fi({wi, ■ ■ ■ ,wj}) < e~ T p 3j r({wi,- ■ ■ ,wj})} , 

and 

U J " I e- T p a>r ({«;i,---,«;j}), on A hr 

Letting Zj i as in (8), we have 

oo „ 

E / -^({wi,-",^-})] 2 

j=0 ■'0<tui<-<tu J -<T 
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xp s ,r({wi, • • • ,Wj})d,Wi ■ --dWj 



lo g( ) _ log( /2 i2 ({Wl '---'^ }) )l 2 

Pl!r({w!, ■■■,W j }) p\{r{{ w li- ' ' , w j}) 



- 4 E/ 

j=0 J 0<u>i<---<Wj< 
x.p s ,r({wi, ■ ■ ■ ,w j })dw 1 ■ ■■dWj 

oo . 

- AeT ^2 [fl /2 ({wir--,Wj})- f^ /2 ({wi,---,w j })] 2 dwi---dw j 

j=0 J0<wi<-<Wj<T 



By dividing the integral into four parts: namely A±j n A^ } ^A\^ n A2j,A±j n and 
A9 „■ PI A% ■ , we observe that 

OO „ 

/ [/i 1/2 ({^ir--,^j}) -/ 2 1/2 ({wi,---,^j})] 2 ^i---^j 

j=0 J0<vii<-<Wj<T 
oo . 

< / [/i 1/2 ({wir--,Wj})-/2 1/2 ({*wi>"-,«'j})] 2 d«;i---d«;j. 

j=0 J 0<wi<—<w j<T 

Now we conclude that 

oo „ 

{E / [%({^ir--,^})-%({^i,---,^})] 2 

j=0 ■'0<tui<-<t«j<T 



1 1/2 

XPs,r({«>l, ■ • ■ ,Wj})dlVi ■ ■ ■ dWjj 



oo „ 

< 2e^\Yl / [fl / \{w 1 ,---,w j })-ti /2 ({w 1 ,---,w j })] 2 dw 1 ---dw j \ 

1 j =Q J0<wi<-<Wj<T ' 



1/2 



Lemma 11 now follows from Lemma 10. □ 
The statement of the next lemma can be found in Wong and Shen (1995), page 346, but 
its proof is not provided there. 

Lemma 12 (A Bernstein-type inequality) Let Z\, Z2, ■ ■ ■ be independent identically dis- 
tributed random variables satisfying 

E{\Z^)< 3 -__L^ V j>2. 

Then 

1 n t 2 
n^T ; ^-EZ,)>t]<eM- M2i + Url/2} ], W>0. 

PROOF. The following proof is an adaption of the proof given in Bennett (1962), pages 36 
to 38. Let c > be a suitably chosen constant and Var(Zj) = a 2 . Then 

E^-ez^ = i + c 2 jr c3 ~ 2E[( -\ EZ * )J] = 1 + c 2 F, say. 

J'=2 J ' 
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Since 1 + c 2 F < e c2p , we have 

n 

E{exp[cY l (Zi-EZ i )]}<e n 



i=l 

Hence it follows from Markov's inequality that 



P[^2(Zi-EZi)>tyM < e - ct VS jBe cE?=i(^-BZ i ) 
i=i 

^ e c 2 nF-ct\/n 
< e -* 2 /(4F) 5 

by choosing c such that -F = t/(2cn 1 / 2 ). Now for j > 2, 
|E(Zi - EZtfl < E(\Z 1 -EZ 1 \ j ) 

< 2 j E(\Z 1 \ j ) 



Hence if 26c < 1, 

This implies that 
and consequently 
By taking 



oo _ 

F< 2^(2^ = ^. 

i=2 



t < 2 7 



2cnV2 - 1 - 26c' 
c > 



4 7n i/2 + 2 6t' 
4 7n i/2 + 2 6t' 



we observe that 26c < 1 and 

1 n t 2 
P ^ g> - **> > t] < exp[- 4(27 + w „_ 1/2) ]. 

This proves Lemma 12. □ 
For 5 : N -> i?, let 

1 n 

(79) i/ n (^) = -j^ J[g(Ki,- • • , w i:N . {T) }) - E Str g{{wi^,- ■ • , w ijN . {T) })], 

n i=i 
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assuming that the right hand side exists and {w^i, • ■ ■ , w^jv^t)} ^ s as i n Section 3. We 
observe from Lemma 6 of Wong and Shen (1995) that 

(80) P s ,rWn(Zf)>t]<eM—, — — ], Vt>0, 

8(8co||/ 1 /2-p s 1 / 2 ||2 + 2 t n -i/2) 

where Zf is as in (8), 

c = (e^-l- T -)/(l-e-^f. 

The next lemma is motivated by Theorem 3 of Shen and Wong (1994) and Lemma 7 of 
Wong and Shen (1995). As the proof of the latter lemma is only briefly sketched in Wong 
and Shen (1995), page 348, a detailed proof of Lemma 13 is given below. 

Lemma 13 For any £>0,0<7<1 and M > 0, let 

M 2 

W t 2 ,n) = i6(8cot2+Mn _ 1/2) . 

Assume that 

(81) H B (^,^ n ,p^ qn ) < l^(M,t 2 ,n), 

fe T/2 t/5 T , M>y 3 / 2 

(82) / [H B (^^, q ,n,P^ q J} 1/2 dx < 

JjM/(32n 1 / 2 ) 2e T / / 2 LU 

Then 

KA i2 i2 ^p >'n(i 1 , ri )>M]<3e-NW< 2 ,») i 

\\Psi ,r\ Ps,r I! 2^t,p 3 ^ ,r\^£3~K,q,n 

where v n (.) is as in (79). 

Proof. Without loss of generality, we can assume that 

(83) 3e -(i-7)V(M,tV) < x 

else Lemma 13 is trivial. We observe from Lemma 11 that 

For any 5 nj0 > 5„,i > • • • > 8 nm > 0, there exist T n j,j = 0, • • • , n , with 



\F n j\ =exp[H (^J5,Fk,q,n,PF~ K , q , n )}, 



such that for each p Sl , ri G ^k,q,n one can find fj', /j 7 G such that 

//({">!, ■■■,™/V(T)}) <Pai,ri({^ir--,^JV(T)}) < " " " , ™JV(T)}), S., 
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and 



oo „ 

{ / [Z f v({wi,--- ,w h })- Z fL ({w!,--- ,w h })] 2 

K j 1= Q J Q<wi<-<w n <T 3 3 

1 1/2 

xp s , r ({wi, • • • , w h })dwi ■ ■ ■ dw jl I 

oo „ 

< / [fV({ Wl ,..., Wn }) 1/2 

j 1= J0<vj 1 <-<w jl <T 

1 /9 

-//■({wi,---,^}) 1 / 2 ] 2 ^---^} 



< S n 



,3- 



Here no is a nonnegative integer to be suitably chosen later. Define for k = 0, ■ ■ ■ , no, 
«fc({wi,---,«'JV(T)}) = mm^f^dw!,--- ,w N( r)}), 
l k {{w l ,--- ,w N(T) }) = max //(W,--- ,^iv(T)})- 

Then Z^ fe < Z p , iiri < Z Ufc , < Z Mfc+1 - Z lk+1 < Z Uk - Z lk a. s., and 

[E S:T (Z Uk - 4) 2 ] 1/2 < [E s , r {Z f v - ZfL) 2 } 1 / 2 < 8 n>k , Vk = 0, • • • ,n . 

J k J k 

If no = 0, define B$ = N. If no > 1, let a\ > a?, > ■ ■ ■ > be a sequence of constants 
and define 

Bo = {Z uo - Zi > a±}, 

B k = {Z Uk - Z h > a k+ i,Z Uj - Z tj < dj+i, j = 0, ■ ■ ■ , k - 1}, Vfc = 1, • • • , n - 1, 

B n = (Ufcl ljB fe) C - 

Note that {B k : k = 0, • • ■ , no} forms a partition of jV. Consequently writing ls k to denote 
the indicator function of B k , we have 

no no 

fc=0 fc=0 

no no 

= Z Uo + ^(^« 3 - Z Uj _ 1 )l u . ) < fe < no B, c + ^(2p si , ri - Z Uk )l Bk - 

j=l k=0 

Let 771, ■ ■ ■ , ijno+i be strictly positive constants such that 2ni + • • • + 2ry no + ry no+ i < 7M/8. 
Then 



^[ sup ^(4 liri )>M] 

I, 1/2 1/2 1| ^- 

I IPs 1,7*1 Ps,r || 2 _^^Psi ,7*1 k,q,n 

< KA s ^P ^ n (Z U0 )>M-^] 

H 1/2 _ 1/2,, 4 
||Ps^,rj Ps,r || 2_^Psi ,7*1 ^^~k,q,n 
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no n 
+ p sA i2 i2 su p M^(Zu j -Zu i -i) 1 U j < k <^ ) B k )>'52vj] 

\\pl{% 1 -pl(r\\2<t,Ps l: r 1 ^ii,q,n j=l j=l 

\\pV^r 1 -pV,r\\2<t,Ps 1 ,r 1 &J r k,q,n fc=0 ' k = l 

< \F n ,o\ 2 2 SUp ^ s >n(^) > M — ] 

IIPsj ,7"-^ Ps,r l|2f^ >Ps;L,rj k,q,n 

no 

+ ^Z P tA SUP ^n((^ - ^.Jlu^^Bj > Vj] 

r - : I, 1/2 i/2n 

J=l IPsi.T! -Ps,r ||2<t,Ps 1 ,r i e./ r R, g ,n 

no - 1 



1/2 1 /2 

fc=0 llPsi.ri-Ps'r l^^tjPsLriE^S.a.n 



+P* r [ sup v n {{Z p - Z u )1 B ) > — + r/„ 0+1 ] 

I, 1/2 _ 1/2,, O 
IIPs^,r^ Ps,r || 2 J^iPsi C-/~ k,q,n 

< |-F„,o| SUp P s , r K(Z U() ) >M--^-] 

IIPsj — Ps,r || 2 ^t>Ps± ,ri ^^~k,,q,n 

n 3 

sup P Sjr [v n ((Z Uj — Z Uj _ 1 )l\j :j < k < no B k 

j = l 1=0 \\pl{\-pl{r h<t,Ps l ,r i eJ r k,q,n 

n -l 

+ P sA SU P ^n((^p Sl , ri - Z Uk )l Bk ) > T] k+1 ] 

1 1/2 1/2 - 

fc=0 llPsi.ri-Psfr Ih^tjPsLrie^s.a.n 

(84) +P* r [ sup z/ n ((Z Psi , ri -Z u )1 B )>i_ + 7/no+1 ]. 

I, 1/2 _ 1/2,, O 
IIPs^,r^ Ps,r \\2^t>Ps-L,r-L fc-' k,q,n 

Since H B may be replaced by a larger continuous function at the expense of an arbitrarily 
small increase in these values, we shall follow Alexander (1984), page 1045, and assume for 
the rest of this proof that H B {.,J 7 k ^ q ^ n} p : p kqn ) is continuous and strictly decreasing from 
oo to on (0, a] for some a. We define 

5 nfi =inf{z : H B (^,T k>q , n ,p^ q J < ^(M,t 2 ,n)}. 
Case 1. Suppose that <5„ )0 > 7M/(8n 1 / 2 ). Then we further define 

*H B ( 6 -^,F~ K , q , n , P^, q J}}, Vj = 1, 2, • • • , 
n = mm{j : 5 nJ = ^-^}, 



= 4 <W-i[ — ] 7 , Vj = l,---,n + 1, 

7 
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Vj 

Thus S n>no = 5 n>no+1 = 7 M/(8n 1 / 2 ), 



2e r / 2 

2e r / 2 ~ 10' 

We observe from Lemma 3.1 of Alexander (1984) and (82) that 
i=i ' j=l i=0 



g no — 1 



r 4 „ R/ <5 r 

no+l 



1/2 5^ ^n,j-l[4-ff ( y 2 j-F/e^nj P^s,,^)] ^ 



7 l/2 "nj-n™ V 2eT/2 



J =710 



< 



rJVll ^ $n,j[H B ( 'i/ 2 ^H,q,n, PFr^u)] 1 ^ 
3=0 Z€ 

7 ' y 7 M/(32nV2) 2e r / z 

< 2^ 

- 8 

Now we observe from (80) that 

|^n,o| i2 i 2 sup P Sir K(Z U0 ) > (1 - 2- 2 7 )M] 

IIPsj — Ps,r \\2<tiPsi ,rj €LJ~fc,q,n 

e H»V n , e-T/>/2Sz, q ,n,P*- Ktq , n ) exD r (1 - 2- 2 7 ) 2 M 2 

P1 8[8c (2- 1 e— / 2 <5 n , + t) 2 + 2n"V2(i - 2- 2 7 )M] 
if^oe-/^,^,^^) r (l-2- 2 7 ) 2 M 2 

P1 16[8c t 2 + n- 1 /2(i_2-2 7 ) M ] ; 

< exp{[J-(l- J)^(M,t 2 ,n)} 

< exp [_(i_^ )V) (M,t 2 ,n)], 



since 



\U /2 ~ Pl!r% < ll^-^Jb + ll^-^b 

~ 2e-/ 2 + 

< — . 

~ 10 
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Next we have 

Vai 8 , r [(Z Uj - Z^^JIu^^^bJ < ^ s ,r[(^_i - z lj-i) 2 } 

< S l,j-n Vj = l,---,n , 

and — < — -^Mj-i < -^Mj — ^rij-i < on ^j<k<n Bk- Hence by the one sided version 
of Bernstein's inequality [see Bennett (1962), page 38], 

PsAMiZuj-Zu^lu^^B,) > Vj) < exp [~ 2 (^ . i + a ^./(3 n i/2)) ]' V -? = lr--,n . 
Since 

2(^_ 1 + a j r ? ,/(3nV2)) 22^ " 7 ^ 1 2e^ ^ J ' 

we have 

n j 

5^(11 l"^.'!) SUP Ps,r[ u n{{Z Uj -^--Jlu^fc^Bfc) > 

j=i 7 i=o /e 

2(1-7)^„b, ^ 



+ exp[ —1- H B (-^,T k ^ n , pr- Ktq> J] 

7 i=o e 

r ^ 2(1- 7 )(EL 4Q b 5„, , 

< 2^ exp ^ z H y^F/2^^,p^n, q ,J\ 

3=1 7 

' +gp [-('-^(*-'+£e.-'*') W f,„)i. 



"2 

Thus it follows from (83) that 



XHI l^'D t 2 x 2 SUP PsA u n({Z Ui - ^-i)lu 3 < fe <„ B fc ) > Vj] 

< 2exp[-(l -7)^(M,t 2 ,n)]. 



SPIKE TRAIN MODELS 



44 



Next using Markov's inequality, we observe that for < k < no — 1, 

P (U \ <" P (7 7 -> „ W E s ,r{Z Uk — Z>i k ) 2 5 nk 

^s,r{£>k) S r's,r(^u k - ^l k > Ofc+lJ S g - 

a k+l a k+l 



Hence 



i 2 SUP ^n((Zp sl , n - Z Ufc )l Bfc ) 

IIPsx ,ri ~Ps,r W'lH^tiPs-L ,r\ ^^k,q,n 

< n 1 / 2 ^ sup E Sir .[(Z Ufc -4)l Bfc ] 

< n 1 ' 2 sup [E s , r (Zu k -Z lk ) 2 P. %r (B k )} 1 / 2 



< 



n 



1/2,5; 



1/2 1/2,, . 

2 

n,fc 



%+l 



, Vfe = 0, • ■ ■ , no — 1. 



This implies that 



n () -l 

^ P* r [ sup ^((^.n - ^u fc )lB fc ) > %+l] = 0. 



K , q , n 



Finally, 



SUP Vn{{Z psin - Z Uno )l Bno ] 

sup n 1/2 ^,rl4 liri - Zu no \ 



and consequently 

SUp "n((^p. liri " Z Uno )l BnQ ) > V + = - 

I, 1/2 _ 1/2 ,, 11 o 

Thus we conclude from (84) that 

Case 2. Suppose that 5„ i0 < 7M/(8n 1 / 2 ). Then define n = and 771 = 7M/I6. As in 
Case 1, we have 



|^n,o| sup P s , r [MZu ) > (1 - j)M] < exp[-(l - -i-)V>(M,tV) 

II 1/2 _ 1/2,1 . ^ 4 4 
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and 

7M 

p sA su p M( z Psi , ri - z u no )iB no ) > — + »Mo+i] = 0. 

I, 1/2 _ 1/2,, , „ O 

This completes the proof of Lemma 13. □ 
PROOF of Proposition 1. Without loss of generality, we can assume that 

(85) ^l-silo) 1 " L 

We observe that (10) holds with e replaced by any s such that e < s < 1. Let 7 = 1/2, 
e T / 2 = 5, i = y/2s and M = 7n 1 / 2 s 2 /2. Then it follows from Lemma 11 that 



r e T / 2 t/5 „ /-v^s , . o„r/2 

/ , [^ B (^,^,n,P^„J] 1/2 dx < / [2^c?i /9 c s>s (^; 

/ 7 M/(32nV2) 2e T > z J s 2/ 2 s X 

< 



n l/2 s 2 



2 13 V2 

M7 3 / 2 

2 10 ' 

and since q > 1/2, 

n^V .2<?- 1. 1 



^2te+2)/5C3 /9 C Si5 (2e-/ 2 ) 1 / g 



< 



2 13 \/2 2 9 / (^/2 s )(2g-l)/(2 (? ) _ ( 2 -3 s 2)(2g-l)/(2g) 



M ^ 2g- 1 1 



2 n V2 2q '( v / 2 s )(29-i)/(2g) _ ( 2 -3 s 2)(2 g -i)/(2 g ) 
Also with tp(M,t 2 ,n) as in Lemma 13, we have 

t_ 2(g+ 2)/ gC l/ gc ^ (2e ,/2 ) l/ g 

M 2 ,2g-l x - 1 



< 



< 



223 S 1/ ? V 2g / [( v ^ s )(2g-l)/(2( / ) _ ( 2 -3 a 2)(29-l)/(29)]2 

M 2 

2 23 s 2 

M 2 



4s 2 (64c + 1) 
^(M,t 2 ,n). 



Consequently, we observe from Lemma 13 that 



P sA SU P v n(Z p ) > — — ] < 3ex P [- 

,,1/2 _ _i/2,| 2<2 o ^. 4 2'(2 D c 
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Let A = {p Sl ,n £ Fk,q,n '■ s 2 < IIPsi^i — Ps^ 2 1 1 1 ^ 2s 2 }. By Lemma 4 of Wong and Shen 
(1995), 

supE(Z Psi . ri )<-(l-5)s 2 , 
A 

where 5 = 2 e - r / 2 (l - e" r / 2 )- 2 = 5/8. Now 

P* jsup 17 > eX pf-n S 2 (l - <5 - -)U 

nV2 s 2 



< P* r {supi/ n (Z Paiiri ) > 



and hence 



p:a sn P fr psurd{m,ir--,w hmT) }) ^ e _ ns2/8} 



< 3exp[- 



2 



2 7 (26 C0 + 1) J 



Let L be the smallest integer such that 2 L e 2 > 4 > maxlHp^^ — Ps^Hi : Psi,n e ^k,q,n}- 
Then 

' ibi^-pi!?i| a >.*. 1 .n^.-ti ^(Ki.-.Vffl}) 

< VC{ sup n ^'" ( r Kl '---'^ (r i" >e-- 2 / 8 } 

,=0 ' 2, £2 <||4 2 ri - P ^||2< 2J+ i £2 , Psi , rie ^, q , nl=1 ^(Kl--,^)}) 

1 r 2'/,: 2 
Hence we conclude from (85) that 



* 3 E ex P[- 27(2 a C0 + 1) ]- 



p; r { sup f[ ^■^Kir--,^ <( T)}) > e _ n£V 8 } 

ne 2 

< 4exp[- 



2 7 (26 C0 + 1) J ' 

This proves Proposition 1. □ 
Lemma 14 Let st t (t) and r$i(t) be as in (11). Then 

Es,r{-^- 1) < Cf- i5 n , 

PJ -t 

where Ca,i is a constant depending only on R. 
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Proof. Let k = kq V 1. Then 

E S A^- - 1) 

Pj _t 

y. /• e~ ^ «WKt-^c(o)^ nil s(wj)r(w t - w^) 



X [e" Jo T -(*)r(t-to C (t))* J| S(^)r(^ - - 
i=l 

i 

e - So sl(t)rttt-w m )dt -Q s t ( w .) r t ( w . _ w^jdw! ■ ■ ■ dWj 
i=l 

< e ^(^+S n )T / |e" /0 S(t)r(t- W<(t) )dt "Q g^.)^. _ _ 

j =0 J0<wi<-<Wj<T i=1 
e - So sUt)rUt~w m )dt "Q s t ( Wi ) r t ( Wi - u,^)!^! . . . d^. 



i=l 

Now we observe that 

|4(^j) r n(^j - w^i-i) - s(wi)r(wi - Wi-i)\ < 2R 2 5 n (2R + 5 n ), 

and 

| e -/o Ts i( t )4(*— !«C(t)) d * — e - So s (*) r (*- w C(«)) d *| 
< |1 _ e S T l s ™(t)rt(t-w c(t) )-s(t)r(t-w at) ) 



i=0 

This implies that for j = 1, 2, ■ ■ ■ , 



/" |e-i'o Ts WKt-«'CW)^pj s ( u ,.) r ( u , i _ u,^) _ 

j 

e - / T 4(t)4(t-i«c( t ))d*pj s t( Wi ) r t( Wi -Wi-iJIdwi ■■■dw j 
i=i 

< 5 n {2^+ 2 (2K + ^rf: ^^^ + 2^- 2 (2k + <5 n )}^. 

i=o ^ + j ' J ' 

This proves Lemma 14. □ 

Lemma 15 Zei N(t),t G [0, T), 6e a counting process with conditional intensity Ai(.|.) as 
in (1). Suppose that (12) holds and 

(86) m-=^\PsAN{t + 5)-N(t) = l], Vt€[0,T). 
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Then for s±,ri € 0«, g , n , 

/ P-,r({«'i,---,«; J -})log[- ]dwi---dw 

~5 JO<wi<-<Wj<T PsunKiWl, , w 3l) 



T F t 



Jo Jo 1 s(t)r(u) 81 s(t)r(u) Jm ; W ^ 
V / p a , r ({wi,---,Wj})log[ Ps - r ^ 1 ' ,Wj ^ - ]dW! ■ ■ ■ dWj < 1, 



then 



and 



E / P,,r({Ml, - , ^}) log[ ' ' ' ' ]d Wl • • • du, 

20 / c ; s(t)e-Jo s ( u ) du dt 200 ./o 



> / Ps^iW!,- ■■ ,Wj}) \0g[ J — T -\dw 1 ---dw 

1 1 1 

> mm{ 1 , } 

20 J Q T f* £(t - u)a(t)r(«)e- si - v)r{ - v ~ t+u)dv dudt 200 



x[ ^ f \si{t)n{u) - s{t)r{u)\i{t - u)e-^ s{v)r(v - t+u)dv dudt} 2 . 
Jo Jo 

Proof. Writing the expectation as a Lebesgue-Stieltjes integral [cf. Aalen (1978) and Karr 
(1987); see also Miller (1985), page 1455, for a different approach], we observe that 



V f iv({wi, • • • ,Wj}) log[ Ps > r ^ u I^IL ]d Wl ---dwj 

PsAi w li - ■ ■ i w N(T)}) , 



E s ,r log[ 



T N(T) 



E s . r {- / s(t)r(t - wm t ))dt + V] log[s(u;j)r(iWj - Wj-i)] 
+ / si(t)ri(t - w N i t ))dt - y~] log^i^On^ - Wj-i)]} 
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= - F s(t)e-ti s{u)du dt+ ( T \og[s(t)]e- ti s ^ du s(t)dt 
Jo Jo 

- [ f s{t)r(u)Z(t - u)e~ siv)riv ~ t+u)dv dudt 
Jo Jo 

+ f f log[s(t)r(u)]£(t - u)e~ £-« s(v)r(v-t+u)dv s ( t y( u ) dudt 
Jo Jo 

rT rT 

+ / Sl {t)e'^ s ^ du dt- / \og[ Sl {t)]e'^ s ^ du s{t)dt 
Jo Jo 

+ f [ Sl (t) n (u)t(t-u)e~^ s{v)r{v - t+u)dv dudt 
Jo Jo 

- f riog[s 1 (t)r 1 (u)]e(t-n)e-^- s(,;)r{,; - i+u) *s(t)r(n)dndt 
Jo Jo 

= f<$- 1 -<$r lw *-*' w ** 

Next suppose that 

V /" P«,r({wi, • • • , Wj» log[ Pa - r ^" ;i ' L^zlL]^...^- < l. 

Since y — 1 — log(y) > for all y E [0, oo) with equality only if y = 1, we conclude that 

(87) f - 1 - log[^]} S (t)e-/o < 1, 

Jo S W sit) 



and 



T /-t 



{ £i^iM _ 1 _ l og[ ^M m _ u)s {t)r{u)e-^^-^dudt < 1. 



Observing that 



y - 1 - log(y) > (l/ 1Q 1)2 l{y G (0, 6)} + > 6}, 



it follows from (87) that 



> 
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> min{ 5_ 1 }[ [ T \ S -M- l\s(t)e-fi'M du dt] 2 . 

In a similar manner, we have 

7 7 1 s (t)r(«) SL a(*)r(«) Jm ' W K } 

1 1 , 

> mm{ - t , } 

20 f Q T f* £(t - u)s(t)r(u)e- s(v)r(v-t+u)dv dudf 200 

'* , S 1 (t)r 1 ( U ) ^ _ u)s{t)r(u)e - j:_ u s(v)r(v-t + u)dv dudt] 2_ 




o Jo s(t)r(u) 



This proves Lemma 15. □ 
Proof of Lemma 1. Following Yatracos (1988), page 1183, we observe that 

sup{£ s , r [ / \s n (t) - s(t)\dt] : s £ @f-, q ,r £ ®e,k,q\ 
Jo 

> sup{E' Sl 

,ri [ / \Sn 

(t) - Sl (t)\dt] : Sl E e h , q , n } 

Jo 

1 l' T 

- T E E wU \Ut) ~ Sl(t)\dt], 

card(e S! g !n ) ^ J 



for any n G ®e,k,q- Define s* G 6jt i9 , n such that 

/ |S n (t)-a£(*)|dt = inf{/ |S„(*)-ai(t)|dt :ai €e«,,,„}. 

JO JO 

Then we have for si G 6™, 

f \s* n (t) - Sl (t)\dt < [ \s* n (t) -s n (t)\dt+ [ \s n (t) - Sl (t)\dt 
Jo Jo Jo 

< 2 f \s n (t) - Sl (t)\dt. 
Jo 

So 

r-T 



card(e Sj(?)n ) 



(*) - Sl (t)\dt] 

~T Jo 



^ o ITTS ^ E ^i.n[/ T |^(*)-aiWI*] 

2card(e Si g in ) ^ J 
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(89) 



> inf{ / \si(t) - s 2 (t)\dt : s x / s 2 , s 1 ,s 2 € &a,q,n} 
Jo 

£ ^i.nOS^l)- 



2card(e S)(?)n ) 



We observe from Fano's lemma [cf. Ibragimov and Has'minskii (1981), pages 323 to 325, or 
Yatracos (1988), page 1182] that 

1 



card(6K )(?)n ) 



siee- K 



> 1 



i { 

R,„J - 11 I 



log[card(0K, g , n ) - 1] 



log 2 + 



(90) 



+- 



1 



[card(G Si(?ir 



> E aun log I I - 

f_l JWiCW,!, ■ ■ ■ , w i,Ni(T)s) 



(15) now follows from (88), (89) and (90). (16) is proved in a similar manner. 



□ 



9 Appendix B 

PROOF of (48). Let u = 5T~ 1 / 2 for some < 5 < k. The contribution to C w ,« from (44) 
is equal to (under Pg w ) 



d 2 



1=1 j/ey(i) 
d cT r j2 



v=y 



(91) 



E y(0 



i=i 



^Sw^V^^efo + o(l) a.s. as T -► oo. 
dv z J 



If j^zg$(v) exists for all v in an interval (vq,vi), then by integration by parts, 



VI 



"0 



dv 2 



(92) 



dv 



e 0w9w i v )dv 

V = V\ 

*> =t >0 J VQ 



Of 



By letting vi j and vq 1 /ij, where /ij and are adjacent points in Hi, it follows 
from (92) that (91) is equal to 



(93) T^VA, V (±$(v) -±$(v) )e^ ) W + e w r w + o(l) 



=i ZieHi 

By the law of large numbers, the contribution to C WjU from (45) is equal to 
d rh f " - ■' d 

v]h dv" " * ' v\hJ 



(<"> ^X>< £ [jf'(*+-»)*](s«S?wU-^w|J^<" + ^ 

i=l heHi n 
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=1 hem 



d (i) 

vlh dv 



- — (iwiv) )e 0wfl -W+o(l) 

v[hJ 



almost surely as T — > oo. Since the contribution from (46) to C WjU is asymptotically- 
negligible, it follows from adding up (93) and (94) that 



(95) 



lim [C w>u - 6> w r w ] -> a.s. under P ew . 

T— >oo 



Since the first and second derivatives of <?w are bounded and continuous by (A2), it follows 
from (47) that there exists j3 s — > as s — > such that 



(96) 



sup T\u 2 C WtU - x 2 C w , x \ < S 



for all large T with probability 1. We can conclude (48) from (95) and (96). □ 
Proof of Lemma 6. By stationarity, we may assume without loss of generality t = 0. 
Let I > 1 and let us denote by Qg (= Qe,e) the probability measure under which yW is 
generated as a Poisson point process on [0, T + inT^ 1 / 2 ) with intensity 

T]i(u) = \i exp[6gw (u)] for all 1 < i < d, 

where g$(u) = g®(u) + g$(u - iKT' 1 / 2 ). Let 



(97) 



w (c) = SUp 
6»>0 



2#c-T" 1 ^A i 



i=i 



and let # w > attain the supremum on the right hand side of (97). Define S x = S x + 
S x+ ^ kT -i/2- It follows from the arguments in (36), (41) and (42) that 



d 

dx 



x=0 



E 6w [S ] = 2c, E g 
where E^ denotes expectation with respect to , and 

Sq \ rp-l ( "W 



0{T- 1 ) 



Cova 



dx x \x=0 



r w 



where t> w and r w are defined as in (26) but with Jq*^ ' replacing J Q T , g$ replacing 
and W replacing 6* w . Hence for intervals I^t, h,T satisfying the conditions of Lemma 3, 

~ (2ir)- 1 (5»fw)- 1/2 ( / e-'VW"1 dzM f e~4'< 2f -> dz 2 ). 

By the arguments in the proof of Lemma 5, it follows that analogous to (43), 

P W (A n A fKT - 1 , 2 ) < P w \ sup S u >max(2c,S ,S KT -i/ 2 )\ 

1 0<m<kT-V2 > 
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(98) ~ kT~ 1 ' 2 ^ 



3 -T0 w (c) 



for all £ and large T, where Cw = (2tt)- 1 (t w /v w ) 1 / 2 . It remains for us to show that there 
exists a constant 7 > such that with probability 1, 

~ d riKT-^l^+T ,., 

<Mc) > 9 w c - T- 1 V Ai / ( e ^w^ ; («)/2 _ x j dn 

i=i ^ 

(99) > 0w(c) +7min{^ 2 K 2 T" 1 ,l} 

for all 1 < £ < T 3 / 2 /k + 1 and T large, so that (49) follows by adding up (98) over 
1 < £ < T 3 / 2 /k + 1. 

The first inequality in (99) follows directly from letting 6 = w /2 in the right hand side 
of (97). By the identity 

2e 0w (u)+gtt («"teT- 1 /2)]/ 2 
it follows from (23) that 

d r ltK,T- 1 / 2 +T 



T i=l ^ 



1 d rlnT^ 1 / 2 +T 

(100) = </, w ( c ) + -L Va, / ' (e e wS «H/2_ e e wff W( U -^T-V 2 ) /2) 2^ 

2T i=l ^ 

and the second inequality of (99) indeed holds for all large T with probability 1. □ 
Proof of Lemma 9. The proof of Lemma 9 uses arguments similar to the proof of 
Lemma 6. The main changes are in replacing kT -1 / 2 by /cT -1 . Analogous to (98), there 
exists a constant C > such that with probability 1, 

P w (A t n A WkT -i) < P w \St + S t+(KT -i < 2c, sup (S u + S u+iliT -i) > 2c I 

1 «u<(+k7 1 - 1 ' 

(101) < C K T- 1 / 2 e- T ^ (c) 

for all large T, if k is chosen large enough. By (100) (with kT~ 1 / 2 replaced by kT _1 ), the 
following analogue to (99), 

(102) 4> w (c) > w (c) +7min{^T" 1 ,l} 

holds with probability 1 for some 7 > and hence Lemma 9 follows from (101) and (102). 
□ 
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